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BREAST. G ASTRTCaAEBypROSTATE CANCER ASSQCf ATFD 
ANTIGENS AND USES THEREFOR 

Field of the Invention 

The invention relates to nucleic acids and encoded polypeptides which are cancer 
associated antigens expressed in patients afflicted with breast, gastric or prostate cancer. The 
invention also relates to agents which bind the nucleic acids or polypeptides. The nucleic acid 
molecules, polypeptides coded for by such molecules and peptides derived therefrom, as well 
as related antibodies and cytolytic T lymphocytes, are useful, inter alia, in diagnostic and 
therapeutic contexts. 

Background of the Invention 

The mechanism by which T cells recognize foreign ipaterials has been implicated in 
cancer. A number of cytolytic T lymphocyte (CTL) clones directed against autologous 
melanoma antigens, testicular antigens, and melanocyte differentiation antigens have been 
described. In many instances, the antigens recognized by these clones have been 
characterized. 

The use of autologous CTLs for identifying tumor antigens requires that the target 
cells which express the antigens can be cultured in vitro and that stable lines of autologous 
CTL clones which recognize the antigen-expressing cells can be isolated and propagated. 
While this approach has worked well for melanoma antigens, other tumor types, such as 
epithelial cancers including breast and colon cancer, have proved refractory to the approach. 

More recently another approach to the problem has been described by Sahin et al. 
(Proc. Natl Acad Sci. USA 92:1 1810-1 1813, 1995). According to this approach, autologous 
antisera are used to identify immunogenic protein antigens expressed in cancer cells by 
screening expression libraries constructed from tumor cell cDNA. Antigen-encoding clones 
so identified have been found to elicit a high-titer humoral immune response in the patients 
from which the antisera were obtained. Such a high-titer IgG response implies helper T cell 
recognition of the detected antigen. These tumor antigens can then be screened for the 
presence of MHC/HLA class I and class II motifs and reactivity with CTLs. 

Since the individual tumor antigens presently known may be expressed only in a 
fraction of tumors, the availability of additional tumor antigens would significantly enlarge the 
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proportion of patients who are potentially eligible for therapeutic interventions. Thus there 
presently is a need for additional tumor antigens for development of therapeutics and 
diagnostics applicable to a greater number of cancer patients having various cancers. 
The invention is elaborated upon further in the disclosure which follows. 

5 

Summary of the Invention 

Autologous antibody screening has now been applied to breast, gastric and prostate 
cancer using antisera from cancer patients. Numerous cancer associated antigens have been 
identified. The invention provides, inter alia, isolated nucleic acid molecules, expression 

10 vectors containing those molecules and host cells transfected with those molecules. The 
invention also provides isolated proteins and peptides, antibodies to those proteins and 
peptides and CTLs which recognize the proteins and peptides. Fragments including 
functional fragments and variants of the foregoing also are provided. Kits containing the 
foregoing molecules additionally are provided. The foregoing can be used in the diagnosis, 

15 monitoring, research, or treatment of conditions characterized by the expression of one or 
more cancer associated antigens. 

Prior to the present invention, only a handful of cancer associated genes had been 
identified in the past 20 years. The invention involves the surprising discovery of several 
genes, some previously known and some previously unknown, which are expressed in 

20 individuals who have cancer. These individuals all have serum antibodies against the proteins 
(or fragments thereof) encoded by these genes. Thus, abnormally expressed genes are 
recognized by the host's immune system and therefore can form a basis for diagnosis, 
momtoring.and. therapy,. 

The invention involves the use of a single material, a plurality of different materials 

25 and even large panels and combinations of materials. For example, a single gene, a single 
protein encoded by a gene, a single functional fragment thereof, a single antibody thereto, etc. 
can be used in methods and products of the invention. Likewise, pairs, groups and even 
panels of these materials and optionally other cancer associated antigen genes and/or gene 
products can be used for diagnosis, monitoring and therapy. The pairs, groups or panels can 

30 involve 2, 3, 4, 5 or more genes, gene products, fragments thereof or agents that recognize 
such materials. A plurality of such materials are not only useful in monitoring, typing, 
characterizing and diagnosing cells abnormally expressing such genes, but a plurality of such 



WO 00/73801 PCT/US00/14749 

- 3 - 

materials can be used therapeutically. An example of the use of a plurality of such materials 
for the prevention, delay of onset, amelioration, etc. of cancer cells, which express or will 
express such genes prophylactically or acutely. Any and all combinations of the genes, gene 
products, and materials which recognize the genes and gene products can be tested and 
5 identified for use according to the invention. It would be far too lengthy to recite all such 
■ combinations; those skilled in the art, particularly in view of the teaching contained herein, 

j • will readily be able to determine which combinations are most appropriate for which 

circumstances. 

j As will be clear from the following discussion, the invention has in vivo and in vitro 

| 10 "ses. deluding for therapeutic, diagnostic, monitoring and research purposes. One aspect of 

| the invention is the ability to fingerprint a cell expressing a number of the genes identified 

| according to the invention by, for example, quantifying the expression of such gene products. 

Such fingerprints will be characteristic, for example, of the stage of the cancer, the type of the 
cancer, or even the effect in animal models of a therapy on a cancer. Cells also can be 
15 screened to determine whether such cells abnormally express the genes identified according to 
i the invention. 

The invention, in one aspect, is a method of diagnosing a disorder characterized by 
expression of a cancer associated antigen precursor coded for by a nucleic acid molecule. The 
method involves the steps of contacting a biological sample isolated from a subject with an 
20 agent that specifically binds to the nucleic acid molecule, an expression product thereof, or a 
fragment of an expression product thereof complexed with an MHC, preferably an HLA, 
molecule, wherein the nucleic acid molecule is a NA Group 1 nucleic acid molecule, and 
determining the interaction between the agent and the nucleic acid molecule, the expression 
product or fragment of the expression product as a determination of the disorder. 
25 In one embodiment the agent is selected from the group consisting of (a) a nucleic 

acid molecule comprising NA Group 1 nucleic acid molecules or a fragment thereof, (b) a 
nucleic acid molecule comprising NA Group 3 nucleic acid molecules or a fragment thereof, 
(c) a nucleic acid molecule comprising NA Group 5 nucleic acid molecules or a fragment 
thereof, (d) an antibody that binds to an expression product, or a fragment thereof, of NA 
30 group 1 nucleic acids, (e) an antibody that binds to an expression product, or a fragment 
thereof, of NA group 3 nucleic acids, (f) an antibody that binds to an expression product, or a 
fragment thereof, of NA group 5 nucleic acids, (g) and agent that binds to a complex of an 
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MHC, preferably HLA, molecule and a fragment of an expression product of a NA Group 1 
nucleic acid, (h) an agent that binds to a complex of an MHC, preferably HLA, molecule and 
a fragment of an expression product of a NA group 3 nucleic acid, and (i) an agent that binds 
to a complex of an MHC, preferably HLA, molecule and a fragment of an expression product 

5 of a NA Group 5 nucleic acid. 

The disorder maybe characterized by expression of a plurality of cancer associated 
antigen precursors. Thus the methods of diagnosis may include use of a plurality of agents, 
each of which is specific for a different human cancer associated antigen precursor (including 
at least one of the cancer associated antigen precursors disclosed herein), and wherein said 

10 plurality of agents is at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at 
least 9 or at least 10 such agents. Any of the diagnostic methods disclosed herein can be 
applied sequentially over time to permit determination of the prognosis or progression (or 
regression) of the disorder. 

In each of the above embodiments the agent may be specific for a human cancer 

15 associated antigen precursor, including the breast, gastric and prostate cancer associated 
antigen precursors disclosed herein. 

In another aspect the invention is a method for detennining regression, progression or 
onset of a condition characterized by expression of abnormal levels of a protein encoded by a 
nucleic acid molecule that is a NA Group 1 molecule. The method involves the steps of 

20 monitoring a sample, from a subject who has or is suspected of having the condition, for a 
parameter selected from the group consisting of (i) the protein, (ii) a peptide derived from the 
protein, (iii) an antibody which selectively binds the protein or peptide, and (iv) cytolytic T 
cells specific for a complex of the peptide derived from- the protein and an MHC molecule, as 
a determination of regression, progression or onset of said condition. In one embodiment the 

25 sample is a body fluid, a body effusion or a tissue. 

In another embodiment the step of monitoring comprises contacting the sample with a 
detectable agent selected from the group consisting of (a) an antibody which selectively binds 
the protein of (i), or the peptide of (ii), (b) a protein or peptide which binds the antibody of 
(iii), and (c) a cell which presents the complex of the peptide and MHC molecule of (iv). In a 

30 preferred embodiment the antibody, the protein, the peptide or the cell is labeled with a 
radioactive label or an enzyme. The sample in a preferred embodiment is assayed for the 
peptide. Preferably samples are isolated from tissue or bodily fluids of the subject at 
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sequential time points, and the samples are assayed as a determination of the regression, 
progression or onset of the condition from a first sequential time point to a second sequential 
time point. 

According to another embodiment the nucleic acid molecule is one of the following: a 
NA Group 3 molecule or a NA Group 5 molecule. In yet another embodiment the protein is a 
plurality of proteins, the parameter is a plurality of parameters, each of the plurality of 
parameters being specific for a different one of the plurality of proteins. 

The invention in another aspect is a pharmaceutical preparation for a human subject. 
The pharmaceutical preparation includes an agent which when administered to the subject 
enriches selectively the presence of complexes of an HLA molecule and a human cancer 
associated antigen, and a phannaceutically acceptable carrier, wherein the human cancer 
associated antigen is a fragment of a human cancer associated antigen precursor encoded by a 
nucleic acid molecule which comprises a NA Group 1 molecule. In one embodiment the 
nucleic acid molecule is a NA Group 3 nucleic acid molecule. 

The agent in one embodiment comprises a plurality of agents, each of which enriches 
selectively in the subject complexes of an HLA molecule and a different human cancer 
associated antigen. Preferably the plurality is at least two, at least three, at least four or at 
least 5 different such agents. 

In another embodiment the agent is selected from the group consisting of (1) an 
isolated polypeptide comprising the human cancer associated antigen, or a functional variant 
thereof, (2) an isolated nucleic acid operably linked to a promoter for expressing the isolated 
polypeptide, or functional variant thereof, (3) a host cell expressing the isolated polypeptide, 
or functional variant thereof, and (4) isolated complexes of the polypeptide, or functional 
variants thereof, and an HLA molecule. 

The agent may be a cell expressing an isolated polypeptide. In one embodiment the 
agent is a cell expressing an isolated polypeptide comprising the human cancer associated 
antigen or a functional variant thereof. In another embodiment the agent is a cell expressing 
an isolated polypeptide comprising the human cancer associated antigen or a functional 
variant thereof, and wherein the cell expresses an HLA molecule that binds the polypeptide. 
The cell can express one or both of the polypeptide and HLA molecule recombinantly. In 
preferred embodiments the cell is nonproliferative. In yet another embodiment the agent is at 
least two, at least three, at least four or at least five different polypeptides, each representing a 
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different human cancer associated antigen or functional variant thereof. 

The agent in one embodiment is a PP Group 2 polypeptide. In other embodiments the 
agent is a PP Group 3 polypeptide or a PP Group 4 polypeptide. 

Li an embodiment each of the pharmaceutical preparations described herein also 
includes an adjuvant 

According to another aspect the invention, a composition is provided which includes 
an isolated agent that binds selectively a PP Group 1 polypeptide. In separate embodiments 
the agent binds selectively to a polypeptide selected from the following: a PP Group 2 
polypeptide, a PP Group 3 polypeptide, a PP Group 4 polypeptide, and a PP Group 5 
polypeptide. In other embodiments, the agent is a plurality of different agents that bind 
selectively at least two, at least three, at least four, or at least five different such polypeptides. 
In each of the above described embodiments the agent may be an antibody. 

In another aspect the invention is a composition of matter composed of a conjugate of 
the agent of the above-described compositions of the invention and a therapeutic or diagnostic 
agent. Preferably the conjugate is of the agent and a therapeutic or diagnostic that is an 
antineoplastic. 

The invention in another aspect is a pharmaceutical composition which includes an 
isolated nucleic acid molecule selected from the group consisting of: (1) NA Group 1 
molecules, and (2) NA Group 2 molecules, and a pharmaceutically acceptable carrier. In one 
embodiment the isolated nucleic acid molecule comprises a NA Group 3 or NA Group 4 
molecule. In another embodiment the isolated nucleic acid molecule comprises at least two 
isolated nucleic acid molecules coding for two different polypeptides, each polypeptide 
comprising a different cancer associated antigen. 

Preferably the pharmaceutical composition also includes an expression vector with a 
promoter operably linked to the isolated nucleic acid molecule. In another embodiment the 
pharmaceutical composition also includes a host cell recombinantly expressing the isolated 

nucleic acid molecule. 

According to another aspect of the invention a pharmaceutical composition is 
provided. The pharmaceutical composition includes an isolated polypeptide comprising a PP 
Group 1 or a PP Group 2 polypeptide, and a pharmaceutically acceptable carrier. In one 
embodiment the isolated polypeptide comprises a PP Group 3 or a PP Group 4 polypeptide. 

In another embodiment the isolated polypeptide comprises at least two different 
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polypeptides, each comprising a different cancer associated antigen at least one of which is 
encoded by aNA group 1 molecule as disclosed herein. In separate embodiments the isolated 
polypeptides are selected from the following: breast cancer polypeptides or HLA binding 
fragments thereof and gastric cancer polypeptides or HLA binding fragments thereof. 
5 In an embodiment each of the pharmaceutical compositions described herein also 

includes an adjuvant 

Another aspect the invention is an isolated nucleic acid molecule comprising a NA 
Group 3 molecule. Another aspect the invention is an isolated nucleic acid molecule 
comprising a NA Group 4 molecule. 
10 The invention in another aspect is an isolated nucleic acid molecule selected from the 

group consisting of (a) a fragment of a nucleic acid selected from the group of nucleic acid 
molecules consisting of SEQ ID Nos:l-593, of sufficient length to represent a sequence 
unique within the human genome, and identifying a nucleic acid encoding a human cancer 
associated antigen precursor, (b) complements of (a), provided that the fragment includes a 
1 5 sequence of contiguous nucleotides which is not identical to any sequence selected from the 
sequence group consisting of (1) sequences having the GenBank accession numbers of Table 
1 and other sequences publicly available as of the filing date of this application, (2) 
complements of (1), and (3) fragments of (1) and (2). Preferably the unique fragments are 
fragments of a nucleic acid selected from the group of nucleic acid molecules consisting of 
20 SEQ ID NOs:12, 15, 34-59, 61, 62, 83-95, 186, 190-205, 297, 327-332, and 335-352. 

In one embodiment the sequence of contiguous nucleotides is selected from the group 
consisting of: (1) at least two contiguous nucleotides nonidentical to the sequences in Table 1, 
(2) at least three contiguous nucleotides nonidentical to the sequences in Table 1, (3) at least 
four contiguous nucleotides nonidentical to the sequences in Table 1, (4) at least five 
25 contiguous nucleotides nonidentical to the sequences in Table 1 , (5) at least six contiguous 
nucleotides nonidentical to the sequences in Table 1, or (6) at least seven contiguous 
nucleotides nonidentical to the sequences in Table 1. 

In another embodiment the fragment has a size selected from the group consisting of at 
least: 8 nucleotides, 10 nucleotides, 12 nucleotides, 14 nucleotides, 16 nucleotides, 18 
30 nucleotides, 20, nucleotides, 22 nucleotides, 24 nucleotides, 26 nucleotides, 28 nucleotides, 
30 nucleotides, 50 nucleotides, 75 nucleotides, 100 nucleotides, 200 nucleotides, 1000 
nucleotides and every integer length therebetween. 
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In yet another embodiment the molecule encodes a polypeptide which, or a fragment 
of which, binds a human HLA receptor (e.g., class I or class II) or a human antibody. 

Another aspect of the invention is an expression vector comprising an isolated nucleic 
acid molecule of the invention described above operably linked to a promoter. 
5 According to one aspect the invention is an expression vector comprising a nucleic 

acid operably linked to a promoter, wherein the nucleic acid is a NA Group 1 or Group 2 
molecule. In another aspect the invention is an expression vector comprising a NA Group 1 
or Group 2 molecule and a nucleic acid encoding an MHC, preferably HLA, molecule. 

In yet another aspect the invention is a host cell transformed or transfected with an 
10 expression vector of the invention described above. 

In another aspect the invention is a host cell transformed or transfected with an 
expression vector comprising an isolated nucleic acid molecule of the invention described 
above operably linked to a promoter, or an expression vector comprising a nucleic acid 
operably linked to a promoter, wherein the nucleic acid is a NA Group 1 or 2 molecule and 
15 further comprising a nucleic acid encoding HLA. 

According to another aspect of the invention an isolated polypeptide encoded by the 
isolated nucleic acid molecules of the invention, described above, is provided. These include 
PP Group 1-5 polypeptides. The invention also includes a fragment of the polypeptide which 
is immunogenic. In one embodiment the fragment, or a portion of the fragment, binds HLA 

20 or a human antibody. 

The invention includes in another aspect an isolated fragment of a human cancer 
associated antigen precursor which, or a portion of which, binds HLA or a human antibody, 
wherein the precursor is encoded by a nucleic acid molecule that is a NA Group 1 molecule. 
In one embodiment the fragment is part of a complex with HLA. In another embodiment the 

25 fragment is between 8 and 12 amino acids in length. In another embodiment the invention 
includes an isolated polypeptide comprising a fragment of the polypeptide of sufficient length 
to represent a sequence unique within the human genome and identifying a polypeptide that is 
a human cancer associated antigen precursor. 

According to another aspect of the invention a kit for detecting the presence of the 

30 expression of a cancer associated antigen precursor is provided. The kit includes a pair of 
isolated nucleic acid molecules each of which consists essentially of a molecule selected from 
the group consisting of (a) a 12-32 nucleotide contiguous segment of the nucleotide sequence 
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of any of the NA Group 1 molecules and (b) complements of (a), wherein the contiguous 
segments are nonoverlapping. In one embodiment the pair of isolated nucleic acid molecules 
is constructed and arranged to selectively amplify an isolated nucleic acid molecule that is a 
NA Group 3 molecule. Preferably, the pair amplifies a human NA Group 3 molecule. 
5 According to another aspect of the invention a method for treating a subject with a 

disorder characterized by expression of a human cancer associated antigen precursor is 
provided. The method includes the step of administering to the subject an amount of an agent, 
which enriches selectively in the subject the presence of complexes of an HLA molecule and a 
human cancer associated antigen, effective to ameliorate the disorder, wherein the human 
) cancer associated antigen is a fragment of a human cancer associated antigen precursor 
encoded by a nucleic acid molecule selected from the group consisting of (a) a nucleic acid 
molecule comprising NA group 1 nucleic acid molecules, (b) a nucleic acid molecule 
comprising NA group 3 nucleic acid molecules, (c) a nucleic acid molecule comprising NA 
group 5 nucleic acid molecules. 

In one embodiment the disorder is characterized by expression of a plurality of human 
cancer associated antigen precursors and wherein the agent is a plurality of agents, each of 
which enriches selectively in the subject the presence of complexes of an HLA molecule and a 
different human cancer associated antigen. Preferably the plurality is at least 2, at least 3, at 
least 4, or at least 5 such agents. 

In another embodiment the agent is an isolated polypeptide selected from the group 
consisting of PP Group 1, PP Group 2, PP Group 3, PP Group 4, and PP group 5 polypeptides. 
In yet another embodiment the disorder is cancer. 

According to another aspect the invention is a method for treating a subject having a 
condition characterized by expression of a cancer associated antigen precursor in cells of the 
subject The method includes the steps of (i) removing an immunoreactive cell containing 
sample from the subject, (ii) contacting the immunoreactive cell containing sample to the host 
cell under conditions favoring production of cytolytic T cells against a human cancer 
associated antigen which is a fragment of the precursor, (iii) introducing the cytolytic T cells 
to the subject in an amount effective to lyse cells which express the human cancer associated 
antigen, wherein the host cell is transformed or transfected with an expression vector 
comprising an isolated nucleic acid molecule operably linked to a promoter, the isolated 
nucleic acid molecule being selected from the group of nucleic acid molecules consisting of 
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NA Group 1, NA Group 2, NA Group 3, NA Group 4, NA Group 5. 

In one embodiment the host cell recombinantly expresses an HLA molecule which 
binds the human cancer associated antigen. In another embodiment the host cell 
endogenously expresses an HLA molecule which binds the human cancer associated antigen. 

5 The invention includes in another aspect a method for treating a subject having a 

condition characterized by expression of a cancer associated antigen precursor in cells of the 
subject. The method includes the steps of (i) identifying a nucleic acid molecule expressed by 
the cells associated with said condition, wherein said nucleic acid molecule is a NA Group 1 
molecule (ii) transfecting a host cell with a nucleic acid molecule selected from the group 

10 consisting of (a) the nucleic acid molecule identified, (b) a fragment of the nucleic acid 
molecule identified which includes a segment coding for a cancer associated antigen, (c) 
deletions, substitutions or additions to (a) or (b), and (d) degenerates of (a), (b), or (c); (iii) 
culturing said transfected host cells to express the transfected nucleic acid molecule, and; (iv) 
introducing an amount of said host cells or an extract thereof to the subject effective to 

1 5 increase an immune response against the cells of the subject associated with the condition. 
Preferably, the antigen is a human antigen and the subject is a human. 

In one embodiment the method also includes the step of (a) identifying an MHC 
molecule which presents a portion of an expression product of the nucleic acid molecule, 
wherein the host cell expresses the same MHC molecule as identified in (a) and wherein the 

20 host cell presents an MHC binding portion of the expression product of the nucleic acid 
molecule. 

In another embodiment the method also includes the step of treating the host cells to 
render them^»on-proliferative.-- 

In yet another embodiment the immune response comprises a B-cell response or a T 
25 cell response. Preferably the response is a T-cell response which comprises generation of 
cytolytic T-cells specific for the host cells presenting the portion of the expression product of 
the nucleic acid molecule or cells of the subject expressing the human cancer associated 
antigen. 

In another embodiment the nucleic acid molecule is a NA Group 3 molecule. 
30 Another aspect of the invention is a method for treating or diagnosing or monitoring a 

subject having a condition characterized by expression of an abnormal amount of a protein 
encoded by a nucleic acid molecule that is a NA Group 1 molecule. The method includes the 
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step of administering to the subject an antibody which specifically binds to the protein or a 
peptide derived therefrom, the antibody being coupled to a therapeutically useful agent, in an 
amount effective to treat the condition. 

In one embodiment the antibody is a monoclonal antibody. Preferably the monoclonal 
5 antibody is a chimeric antibody or a humanized antibody. 

In another aspect the invention is a method for treating a condition characterized by 
expression in a subject of abnormal amounts of a protein encoded by a nucleic acid molecule 
that is a NA Group 1 nucleic acid molecule. The method involves the step of administering to 
a subject at least one of the pharmaceutical compositions of the invention described above in 
10 an amount effective to prevent, delay the onset of, or inhibit the condition in the subject. In 
one embodiment the condition is cancer. In another embodiment the method includes the step 
of first identifying that the subject expresses in a tissue abnormal amounts of the protein. 

The invention in another aspect is a method for treating a subject having a condition 
characterized by expression of abnormal amounts of a protein encoded by a nucleic acid 
15 molecule that is a NA Group 1 nucleic acid molecule. The method includes the steps of (i) 
identifying cells from the subject which express abnormal amounts of the protein; (ii) 
isolating a sample of the cells; (iii) cultivating the cells, and (iv) introducing the cells to the 
subject in an amount effective to provoke an immune response against the cells. 

In one embodiment the method includes the step of rendering the cells non- 
20 proliferative, prior to introducing them to the subject 

In another aspect the invention is a method for treating a pathological cell condition 
characterized by abnormal expression of a protein encoded by a nucleic acid molecule that is a 
NA Group 1 nucleic acid molecule. The method includes the step of administering to a 
subject in need thereof an effective amount of an agent which inhibits the expression or 
25 activity of the protein. 

In one embodiment the agent is an inhibiting antibody which selectively binds to the 
protein and wherein the antibody is a monoclonal antibody, a chimeric antibody, a humanized 
antibody or a fragment thereof. In another embodiment the agent is an antisense nucleic acid 
molecule which selectively binds to the nucleic acid molecule which encodes the protein. In 
30 yet another important embodiment the nucleic acid molecule is a NA Group 3 nucleic acid 
molecule. 

The invention includes in another aspect a composition of matter useful in stimulating 
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an inunune response to a plurality of proteins encoded by nucleic acid molecules that are NA 
Group 1 molecules. The composition is a plurality of peptides derived from the amino acid 
sequences of the proteins, wherein the peptides bind to one or more MHC molecules 
presented on the surface of the cells which express an abnormal amount of the protein. 
5 In one embodiment at least a portion of the plurality of peptides bind to MHC 

molecules and elicit a cytolytic response thereto. In another embodiment the composition of 
matter includes an adjuvant. In another embodiment the adjuvant is a saponin, GM-CSF, or 
an interleukin. In still another embodiment, the compositions also includes at least one 
peptide useful in stimulating an immune response to at least one protein which is not encoded 
10 by nucleic acid molecules that are NA Group 1 molecules, wherein the at least one peptide 
binds to one or more MHC molecules. 

According to another aspect the invention is an isolated antibody which selectively 
binds to a complex of: (i) a peptide derived from a protein encoded by a nucleic acid molecule 
that is a NA Group 1 molecule and (ii) and an MHC molecule to which binds the peptide to 
15 form the complex, wherein the isolated antibody does not bind to (i) or (ii) alone. 

In one embodiment the antibody is a monoclonal antibody, a chimeric antibody, a 
humanized antibody or a fragment thereof. 

The invention also involves the use of the genes, gene products, fragments thereof, 
agents which bind thereto, and so on in the preparation of medicaments. A particular 
20 medicament is for treating cancers including, e.g., one or more of cancers of the breast, cervix, 
ovary, prostate, testis, lung, colon, pancreas, stomach, liver, skin (e.g., melanoma), bladder, 
head and neck, thyroid, blood cells, bone and kidney. Diagnostics for specific cancers and 
groups of cancets also arcenvisioned- 

In certain preferred embodiment, the nucleic acid molecules are selected from the 
25 group consisting of SEQ ID NOs: 1-18, and the polypeptides are encoded by these preferred 
nucleic acid molecules. 

Still other embodiments and aspects of the invention will become apparent in 
connection with the description of the invention which follows. 

30 Detailed Description of the Invention 

In the above summary and in the ensuing description, lists of sequences are provided. 
The lists are meant to embrace each single sequence separately, two or more sequences 



RMS na 



WO 00/73801 PCT/US00/14749 

-13- 

together where they form a part of the same gene, any combination of two or more sequences 
which relate to different genes, including and up to the total number on the list, as if each and 
every combination were separately and specifically enumerated. Likewise, when mentioning 
fragment size, it is intended that a range embrace the smallest fragment mentioned to the full- 
5 length of the sequence (less one nucleotide or amino acid so that it is a ftagment), each and 
every fragment length intended as if specifically enumerated. Thus, if a fragment could be 
between 10 and 15 in length, it is explicitly meant to mean 10, 1 1, 12, 13, 14, or 15 in length. 
The summary and the claims mention antigen precursors and antigens. As used in the 
summary and in the claims, a precursor is substantially the full-length protein encoded by the 
10 coding region of the isolated DNA and the antigen is a peptide which complexes with MHC, 
preferably HLA, and which participates in the immune response as part of that complex. Such 
antigens are typically 9 amino acids long, although this may vary slightly. 

As used herein, a subject is a human, non-human primate, cow, horse, pig, sheep, goat, 
dog, cat or rodent. In all embodiments human cancer antigens and human subjects are 
15 preferred. 

The present invention in one aspect involves the cloning of cDNAs encoding human 
cancer associated antigen precursors using autologous antisera of subjects having breast, 
gastric or prostate cancer. The sequences of the clones representing genes identified 
according to the methods described herein are presented in the attached Sequence Listing. Of 
the foregoing, it can be seen that some of the clones are considered completely novel as no 
coding regions were found in the databases searched. Other clones are novel but have some 
nucleotide or amino acid homologies to sequences deposited in databases (mainly EST 
sequences). Nevertheless, the entire gene sequence was not previously known. In some cases 
no function was suspected and in other cases, even if a function was suspected, it was not 
known that the gene was associated with cancer, or with a particular cancer. In all cases, it 
was not known or suspected that the gene encoded a cancer antigen which reacted with an 
antibody from autologous sera. Analysis of the clone sequences by comparison to nucleic 
acid and protein databases determined that still other of the clones surprisingly are closely 
related to other previously-cloned genes. The sequences of these related genes is also 
presented in the Sequence Listing. The nature of the foregoing genes as encoding antigens 
recognized by the immune systems of cancer patients is, of course, unexpected. 

The invention thus involves in one aspect cancer associated antigen polypeptides, 
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genes encoding those polypeptides, functional modifications and variants of the foregoing, 
userul fragments of the foregoing, as well as diagnostics and therapeutics relating thereto. 

Homologs and alleles of the cancer associated antigen nucleic acids of the invention 
can be identified by conventional techniques. Thus, an aspect of the invention is those nucleic 
5 acid sequences which code for cancer associated antigen precursors. Because this application 
contains so many sequences, the following chart is provided to identify the various groups of 
sequences discussed in the claims and in the summary: 

Nucleic Acid Sequences 
10 NA Group 1. (a) nucleic acid molecules which hybridize under stringent conditions to a 
molecule consisting of a nucleic acid sequence selected from the group consisting of nucleic 
acid sequences among SEQ ID NOs: 1-593, and which code for a cancer associated antigen 
precursor, 

(b) deletions, additions and substitutions which code for a respective cancer 

15 associated antigen precursor, 

(c) nucleic acid molecules that differ from the nucleic acid molecules of (a) or 
(b) in codon sequence due to the degeneracy of the genetic code, and 

(d) complements of (a), (b) or (c). 

20 NA Group 2. Fragments of NA Group 1, which code for a polypeptide which, or a portion of 
which, binds an MHC molecule to form a complex recognized by an autologous antibody or 
lymphocyte. 

NA Group 3. The subset of NA Group 1 where the nucleotide sequence is selected from the 

25 group consisting of: 

(a) previously unknown human nucleic acids coding for a human cancer 
associated antigen precursor, e.g., SEQ ID NOs:12, 15, 34-59, 61, 62, 83-95, 1 86, 190-205, 
297, 327-332, and 335-352, 

(b) deletions, additions and substitutions which code for a respective human 

30 cancer associated antigen precursor, 

(c) nucleic acid molecules that differ from the nucleic acid molecules of (a) or 
(b) in codon sequence due to the degeneracy of the genetic code, and 
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(d) complements of (a), (b) or (c). 

NA Group 4. Fragments of NA Group 3, which code for a polypeptide which, or a portion of 
which, binds to an MHC molecule to form a complex recognized by an autologous antibody 
or lymphocyte. 



NA Group 5. A subset of NA Group 1, comprising human cancer associated antigens that 
react with allogeneic cancer antisera. 



Polypeptide Sequences 
PP Group 1 , Polypeptides encoded by NA Group 1 . 
PP Group 2. Polypeptides encoded by NA Group 2. 
PP Group 3. Polypeptides encoded by NA Group 3. 
PP Group 4. Polypeptides encoded by NA Group 4, 
PP Group 5. Polypeptides encoded by NA Group 5. 



Particularly preferred polypeptides are those recognized by allogeneic sera of cancer 
patients, but not by non-cancer patient control sera. For example, as shown in the Examples 
below, polypeptides encoded by SEQ ID NOs:l-18 are recognized only by antibodies in 
cancer patients antisera. 

The term "stringent conditions" as used herein refers to parameters with which the art 
is familiar. Nucleic acid hybridization parameters may be found in references which compile 
such methods, e.g. Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., 
Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, 
or Current Protocols in Molecular Biology, F.M. Ausubel, et al., eds., John Wiley & Sons, 
Inc., New York. More specifically, stringent conditions, as used herein, refers, for example, 
to hybridization at 65°C in hybridization buffer (3.5 x SSC, 0.02% Ficoll, 0.02% polyvinyl 
pyrrolidone, 0.02% Bovine Serum Albumin, 2.5 mM NaH 2 P0 4 (pH7), 0.5% SDS, 2 mM 
EDTA). SSC is 0.15 M sodium chloride/0.15 M sodium citrate, pH7; SDS is sodium dodecyl 
sulphate; and EDTA is ethylenediaminetetracetic acid. After hybridization, the membrane 
upon which the DNA is transferred is washed, for example, in 2 x SSC at room temperature 
and then at 0.1 - 0.5 x SSC/0.1 x SDS at temperatures up to 68°C. 
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There are other conditions, reagents, and so forth which can be used, which result in a 
similar degree of stringency. The skilled artisan will be familiar with such conditions, and 
thus they are not given here. It will be understood, however, that the skilled artisan will be 
able to manipulate the conditions in a manner to permit the clear identification of homologs 

5 and alleles of cancer associated antigen nucleic acids of the invention (e.g., by using lower 
stringency conditions). The skilled artisan also is familiar with the methodology for screening 
cells and libraries for expression of such molecules which then are routinely isolated, 
followed by isolation of the pertinent nucleic acid molecule and sequencing. 

In general homologs and alleles typically will share at least 80% nucleotide identity 

10 and/or at least 90% amino acid identity to the sequences of cancer associated antigen nucleic 
acid and polypeptides, respectively, in some instances wiU share at least 90% nucleotide 
identity and/or at least 95% amino acid identity and in still other instances will share at least 
95% nucleotide identity and/or at least 99% amino acid identity. The homology can be 
calculated using various, publicly available software tools developed by NCBI (Bethesda, 

15 Maryland) that can be obtained through the Internet (ftp:mcbi.nlm.nih.gov/pub/). Exemplary 
tools include the BLAST system available at http^/www.ncbi.nlmJiih.gov, preferably using 
default settings. Pairwise and ClustalW alignments (BLOSUM30 matrix setting) as well as 
Kyle-Doolittle hydropathic analysis can be obtained using the MacVector sequence analysis 
software (Oxford Molecular Group). Watson-Crick complements of the foregoing nucleic 

20 acids also are embraced by the invention. 

In screening for cancer associated antigen genes, a Southern blot may be performed 
using the foregoing conditions, together with a radioactive probe. After washing the 
membrane. to .which the DNA is finally transferred, the membrane canhe placedagainst X-ray 
film to detect the radioactive signal. In screening for the expression of cancer associated 

25 antigen nucleic acids, Northern blot hybridizations using the foregoing conditions can be 
performed on samples taken from breast, gastric or prostate cancer patients or subjects 
suspected of having a condition characterized by expression of the cancer associated antigen 
genes disclosed herein. Amplification protocols such as polymerase chain reaction using 
primers which hybridize to the sequences presented also can be used for detection of the 

30 cancer associated antigen genes or expression thereof. 

The breast, gastric and prostate cancer associated genes correspond to SEQ ID Nos:l- 
593. These sequences represent genes previously known in humans and genes previously 
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unknown in humans (e.g., SEQ ID NOs:12, 15, 34-59, 61, 62, 83-95, 186, 190-205, 297, 327- 
332, and 335-352). Preferred breast, gastric and prostate cancer associated antigens for the 
methods of diagnosis disclosed herein are those which encode polypeptides that react with 
allogeneic cancer antisera (i.e. NA Group 5). Encoded polypeptides (e.g., proteins), peptides 
5 and antisera thereto are also preferred for diagnosis. 

As used herein with respect to nucleic acids, the term "isolated" means: (i) amplified 
in vitro by, for example, polymerase chain reaction (PCR); (ii) recombinant^ produced by 
cloning; (iii) purified, as by cleavage and gel separation; or (iv) synthesized by, for example, 
chemical synthesis. An isolated nucleic acid is one which is readily manipulable by 
10 recombinant DNA techniques well known in the art. Thus, a nucleotide sequence contained 
in a vector in which 5' and 3' restriction sites are known or for which polymerase chain 
reaction (PCR) primer sequences have been disclosed is considered isolated but a nucleic acid 
sequence existing in its native state in its natural host is not. An isolated nucleic acid may be 
substantially purified, but need not be. For example, a nucleic acid that is isolated within a 
15 cloning or expression vector is not pure in that it may comprise only a tiny percentage of the 
material in the cell in which it resides. Such a nucleic acid is isolated, however, as the term is 
used herein because it is readily manipulable by standard techniques known to those of 
ordinary skill in the art. An isolated nucleic acid as used herein is not a naturally occurring 
chromosome. 

20 As used herein with respect to polypeptides, "isolated" means separated from its native 

environment and present in sufficient quantity to permit its identification or use. Isolated, 
when referring to a protein or polypeptide, means, for example: (i) selectively produced by 
expression cloning or (ii) purified as by chromatography or electrophoresis. Isolated proteins 
or polypeptides may be, but need not be, substantially pure. The term "substantially pure" 
25 means that the proteins or polypeptides are essentially free of other substances with which 
they may be found in nature or in vivo systems to an extent practical and appropriate for their 
intended use. Substantially pure polypeptides may be produced by techniques well known in 
the art. Because an isolated protein may be admixed with a pharmaceutically acceptable 
carrier in a pharmaceutical preparation, the protein may comprise only a small percentage by 
30 weight of the preparation. The protein is nonetheless isolated in that it has been separated 
from the substances with which it may be associated in living systems, i.e. isolated from other 
proteins. 
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The invention also includes degenerate nucleic acids which include alternative codons 
to those present in the native materials. For example, serine residues are encoded by the 
codons TCA, AGT, TCC, TCG, TCT and AGC. Each of the six codons is equivalent for the 
purposes of encoding a serine residue. Thus, it will be apparent to one of ordinary skill in the 
art that any of the serine-encoding nucleotide triplets may be employed to direct the protein 
synthesis apparatus, in vitro or in vivo, to incorporate a serine residue into an elongating 
cancer associated antigen polypeptide. Similarly, nucleotide sequence triplets which encode 
other amino acid residues include, but are not limited to: CCA, CCC, CCG and CCT (proline 
codons); CGA, CGC, CGG, CGT, AGA and AGG (arginine codons); ACA, ACC, ACG and 
ACT (threonine codons); AAC and AAT (asparagine codons); and ATA, ATC and ATT 
(isoleucine codons). Other amino acid residues may be encoded similarly by multiple 
nucleotide sequences. Thus, the invention embraces degenerate nucleic acids that differ from 
the biologically isolated nucleic acids in codon sequence due to the degeneracy of the genetic 
code. 

The invention also provides modified nucleic acid molecules which include additions, 
substitutions and deletions of one or more nucleotides. In preferred embodiments, these 
modified nucleic acid molecules and/or the polypeptides they encode retain at least one 
activity or function of the unmodified nucleic acid molecule and/or the polypeptides, such as 
antigenicity, enzymatic activity, receptor binding, formation of complexes by binding of 
peptides by MHC class I and class II molecules, etc. In certain embodiments, the modified 
nucleic acid molecules encode modified polypeptides, preferably polypeptides having 
conservative amino acid substitutions as are described elsewhere herein. The modified 
nucleic acid molecules are structurally related to the unmodified nucleic acid molecules and in 
preferred embodiments are sufficiently structurally related to the unmodified nucleic acid 
molecules so that the modified and unmodified nucleic acid molecules hybridize under 
stringent conditions known to one of skill in the art. 

For example, modified nucleic acid molecules which encode polypeptides having 
single amino acid changes can be prepared. Each of these nucleic acid molecules can have 
one, two or three nucleotide substitutions exclusive of nucleotide changes corresponding to 
the degeneracy of the genetic code as described herein. Likewise, modified nucleic acid 
molecules which encode polypeptides having two amino acid changes can be prepared which 
have, e.g., 2-6 nucleotide changes. Numerous modified nucleic acid molecules like these will 
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be readily envisioned by one of skill in the art, including for example, substitutions of 
nucleotides in codons encoding amino acids 2 and 3, 2 and 4, 2 and 5, 2 and 6, and so on. In 
the foregoing example, each combination of two amino acids is included in the set of 
modified nucleic acid molecules, as well as all nucleotide substitutions which code for the 
5 amino acid substitutions. Additional nucleic acid molecules that encode polypeptides having 
additional substitutions (i.e., 3 or more), additions or deletions (e.g., by introduction of a stop 
codon or a splice site(s)) also can be prepared and are embraced by the invention as readily 
envisioned by one of ordinary skill in the art. Any of the foregoing nucleic acids or 
polypeptides can be tested by routine experimentation for retention of structural relation or 
10 activity to the nucleic acids and/or polypeptides disclosed herein. 

The invention also provides isolated unique fragments of cancer associated antigen 
nucleic acid sequences or complements thereof. A unique fragment is one that is a 'signature' 
for the larger nucleic acid. It, for example, is long enough to assure that its precise sequence 
is not found in molecules within the human genome outside of the cancer associated antigen 
nucleic acids defined above (and human alleles). Those of ordinary skill in the art may apply 
no more than routine procedures to determine if a fragment is unique within the human 
genome. Unique fragments, however, exclude fragments completely composed of the 
nucleotide sequences of any of the GenBank accession numbers listed in Table 1 or other 
previously pubUshed sequences as of the filing date of the priority documents for sequences 
listed in a respective priority document or the filing date of this application for sequences 
listed for the first time in this application which overlap the sequences of the invention. 

A fragment which is completely composed of the sequence described in the foregoing 
GenBank deposits is one which does not include any of the nucleotides unique to the 
sequences of the invention. Thus, a unique fragment must contain a nucleotide sequence 
other than the exact sequence of those in GenBank or fragments thereof. The difference may 
be an addition, deletion or substitution with respect to the GenBank sequence or it may be a 
sequence wholly separate from the GenBank sequence. 

Unique fragments can be used as probes in Southern and Northern blot assays to 
identify such nucleic acids, or can be used in amplification assays such as those employing 
PCR. As known to those skilled in the art, large probes such as 200, 250, 300 or more 
nucleotides are preferred for certain uses such as Southern and Northern blots, while smaller 
fragments will be preferred for uses such as PCR. Unique fragments also can be used to 
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produce fusion proteins for generating antibodies or determining binding of the polypeptide 
fragments, or for generating immunoassay components. Likewise, unique fragments can be 
employed to produce nonfused fragments of the cancer associated antigen polypeptides, 
useful, for example, in the preparation of antibodies, and in immunoassays. Unique fragments 

5 further can be used as antisense molecules to inhibit the expression of cancer associated 
antigen nucleic acids and polypeptides, particularly for therapeutic purposes as described in 
greater detail below. Unique fragments also can be used to create chimeric nucleic acid 
molecule or polypeptide molecules by, for example, joining all or part of the unique fragment 
to another nucleic acid or polypeptide molecule (homologous or not). For example, the 

10 unique fragment may be similar or identical in large part to a known molecule but may have a 
portion which is nonidentical to the known molecule; the known molecule and the unique 
fragment can be used to construct a molecule containing in large part the known molecule 
with the portion unique to the unique fragment added. Other chimeric molecules will be 
known to one of ordinary skill in the art and can be prepared using standard molecular biology 

15 techniques. 

As will be recognized by those skilled in the art, the size of the unique fragment will 
depend upon its conservancy in the genetic code. Thus, some regions of cancer associated 
antigen sequences and complements thereof will require longer segments to be unique while 
others will require only short segments, typically between 12 and 32 nucleotides (e.g. 12, 13, 

20 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 and 32 or more bases 

long), up to the entire length of the disclosed sequence. As mentioned above, this disclosure 
intends to embrace each and every fragment of each sequence, beginning at the first 
nucleotide* me second nucleotide and so on, up to 8 nucleotides short of the end, and ending 
anywhere from nucleotide number 8, 9, 10 and so on for each sequence, up to the very last 

25 nucleotide (provided the sequence is unique as described above). 

Virtually any segment of the polypeptide coding region of novel cancer associated 
antigen nucleic acids, or complements thereof, that is 25 or more nucleotides in length will be 
unique. Those skilled in the art are well versed in methods for selecting such sequences, 
typically on the basis of the ability of the unique fragment to selectively distinguish the 

30 sequence of interest from other sequences in the human genome of the fragment to those on 
known databases typically is all that is necessary, although in vitro confirmatory hybridization 
and sequencing analysis maybe performed. 
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Especially prefeiTed include nucleic acids encoding a series of epitopes, known as 
*'polytopes , \ The epitopes can be arranged in sequential or overlapping fashion (see, e.g., 
Thomson et al., Proc. Natl Acad, ScL USA 92:5845-5849, 1995;,Gilbert et al., Nature 
Biotechnol 15:1280-1284, 1997), with or without the natural flanking sequences, and can be 
separated by unrelated linker sequences if desired. The polytope is processed to generate 
individual epitopes which are recognized by the immune system for generation of immune 
responses. 

Thus, for example, peptides derived from a polypeptide having an amino acid 
sequence encoded by one of the nucleic acid disclosed herein, and which are presented by 
MHC molecules and recognized by CTL or T helper lymphocytes, can be combined with 
peptides from one or more other cancer associated antigens (e.g. by preparation of hybrid 
nucleic acids or polypeptides) to form 4t polytopes". The two or more peptides (or nucleic 
acids encoding the peptides) can be selected from those described herein, or they can include 
one or more peptides of previously known cancer associated antigens. Exemplary cancer 
associated peptide antigens that can be administered to induce or enhance an immune 
response are derived from tumor associated genes and encoded proteins including MAGE-A1, 
MAGE-A2, MAGE-A3, MAGE-A4, MAGE-A5, MAGE-A6, MAGE-A7, MAGE-A8, 
MAGE-A9, MAGE-A10, MAGE-A11, MAGE-A12, GAGE-1, GAGE-2, GAGE-3, GAGE-4, 
GAGE-5, GAGE-6, GAGE-7, GAGE-8, GAGE-9, BAGE-1, RAGE-1, LB33MUM-1, 
PRAME, NAG, MAGE-B2, MAGE-B3, MAGE-B4, tyrosinase, brain glycogen 
phosphorylase, Melan-A, MAGE-C1, MAGE-C2, MAGE-C3, MAGE-C4, MAGE-C5, NY- 
ESO-1, LAGE-1, SSX-1, SSX-2 (HOM-MEL-40), SSX-4, SSX-5, SCP-1 and CT-7. See, for 
example, PCT application publication no. WO96/10577. Other examples will be known to 
one of ordinary skill in the art (for example, see Coulie, Stem Cells 13:393-403, 1995), and 
can be used in the invention in a like manner as those disclosed herein. One of ordinary skill 
in the art can prepare polypeptides comprising one or more peptides and one or more of the 
foregoing cancer associated peptides, or nucleic acids encoding such polypeptides, according 
to standard procedures of molecular biology. 

Thus polytopes are groups of two or more potentially immunogenic or immune 
response stimulating peptides which can be joined together in various arrangements (e.g. 
concatenated, overlapping). The polytope (or nucleic acid encoding the polytope) can be 
administered in a standard immunization protocol, e.g. to animals, to test the effectiveness of 
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the polytope in stimulating, enhancing and/or provoking an immune response. 

The peptides can be joined together directly or via the use of flanking sequences to 
form polytopes, and the use of polytopes as vaccines is well known in the art (see, e.g., 
Thomson et al, Proc. Acad. Natl. Acad. Sci USA 92(13):5845-5849, 1995; Gilbert et al., 
Nature Biotechnol. 15(12):1280-1284, 1997; Thomson et al., /. Immunol. 157(2):822-826, 
1996; Tarn et al., J. Exp. Med. 171(l):299-306, 1990). For example, Tarn showed that 
polytopes consisting of both MHC class I and class H binding epitopes successfully generated 
antibody and protective immunity in a mouse model. Tarn also demonstrated that polytopes 
comprising "strings" of epitopes are processed to yield individual epitopes which are 
presented by MHC molecules and recognized by CTLs. Thus polytopes containing various 
numbers and combinations of epitopes can be prepared and tested for recognition by CTLs 
and for efficacy in increasing an immune response. 

It is known that tumors express a set of tumor antigens, of which only certain subsets 
maybeexpressedinthetumorofanygivenpatient. Polytopes can be prepared which , 
correspond to the different combination of epitopes representing the subset of tumor rejection 
antigens expressed in a particular patient. Polytopes also can be prepared to reflect a broader 
spectrum of tumor rejection antigens known to be expressed by a tumor type. Polytopes can 
be introduced to a patient in need of such treatment as polypeptide structures, or via the use of 
nucleic acid delivery systems known in the art (see, e.g., Allsopp et al., Eur. J. Immunol. 
26(8):1951-1959, 1996). Adenovirus, pox vims, Ty-virus like particles, adeno-associated 
virus, plasmids, bacteria, etc. can be used in such delivery. One can test the polytope delivery 
systems in mouse models to determine efficacy of the delivery system. The systems also can 

be tested in human clinical trials. 

In instances in which a human HLA class I molecule presents tumor rejection antigens 
derived from cancer associated nucleic acids, the expression vector may also include a nucleic 
acid sequence coding for the HLA molecule that presents any particular tumor rejection 
antigen derived from these nucleic acids and polypeptides. Alternatively, the nucleic acid 
sequence coding for such a HLA molecule can be contained within a separate expression 
vector. In a situation where the vector contains both coding sequences, the single vector can 
be used to transfect a cell which does not normally express either one. Where the coding 
sequences for a cancer associated antigen precursor and the HLA molecule which presents it 
are contained on separate expression vectors, the expression vectors can be cotransfected. 
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The cancer associated antigen precursor coding sequence may be used alone, when, e.g. the 
host cell already expresses a HLA molecule which presents a cancer associated antigen 
derived from precursor molecules. Of course, there is no limit on the particular host cell 
which can be used. As the vectors which contain the two coding sequences may be used in 
any antigen-presenting cells if desired, and the gene for cancer associated antigen precursor 
can be used in host cells which do not express a HLA molecule which presents a cancer 
associated antigen. Further, cell-free transcription systems maybe used in lieu of cells. 

As mentioned above, the invention embraces antisense oligonucleotides that 
selectively bind to a nucleic acid molecule encoding a cancer associated antigen polypeptide, 
to reduce the expression of cancer associated antigens. This is desirable in virtually any 
medical condition wherein a reduction of expression of cancer associated antigens is 
desirable, e.g., in the treatment of cancer. This is also useful for in vitro or in vivo testing of 
the effects of a reduction of expression of one or more cancer associated antigens. 

As used herein, the term "antisense oligonucleotide" or "antisense" describes an 
oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide which hybridizes under 
physiological conditions to DNA comprising a particular gene or to an mRNA transcript of 
that gene and, thereby, inhibits the transcription of that gene and/or the translation of that 
mRNA. The antisense molecules are designed so as to interfere with transcription or 
translation of a target gene upon hybridization with the target gene or transcript Those skilled 
in the art will recognize that the exact length of the antisense oligonucleotide and its degree of 
complementarity with its target will depend upon the specific target selected, including the 
sequence of the target and the particular bases which comprise that sequence. It is preferred 
that the antisense oligonucleotide be constructed and arranged so as to bind selectively with 
the target under physiological conditions, i.e., to hybridize substantially more to the target 
sequence than to any other sequence in the target cell under physiological conditions. Based 
upon the sequences of nucleic acids encoding breast, gastric or prostate cancer associated 
antigens, or upon allelic or homologous genomic and/or cDNA sequences, one of skill in the 
art can easily choose and synthesize any of a number of appropriate antisense molecules for 
use in accordance with the present invention. For example, a "gene walk" comprising a series 
of oligonucleotides of 15-30 nucleotides spanning the length of a cancer associated antigen 
can be prepared, followed by testing for inhibition of cancer associated antigen expression. 
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Optionally, gaps of 5-10 nucleotides can be left between the oligonucleotides to reduce the 
number of oligonucleotides synthesized and tested. 

In order to be sufficiently selective and potent for inhibition, such antisense 
oligonucleotides should comprise at least 10 and, more preferably, at least 15 consecutive 
bases which are complementary to the target, although in certain cases modified 
oligonucleotides as short as 7 bases in length have been used successfully as antisense 
oligonucleotides (Wagner et al., Nature Biotechnol 14:840-844, 1996). Most preferably, the 
antisense oligonucleotides comprise a complementary sequence of 20-30 bases. Although 
oligonucleotides may be chosen which are antisense to any region of the gene or mRNA 
transcripts, in preferred embodiments the antisense oligonucleotides correspond to N-terminal 
or 5' upstream sites such as translation initiation, transcription initiation or promoter sites. In 
addition, 3'-untranslated regions maybe targeted. Targeting to mRNA splicing sites has also 
been used in the art but may be less preferred if alternative mRNA splicing occurs. In 
addition, the antisense is targeted, preferably, to sites in which mRNA secondary structure is 
not expected (see, e.g., Sainio et al., CellMoL Neurobiol. 14(5):439-457, 1994) and at which 
proteins are not expected to bind. Finally, although the listed sequences are cDNA sequences, 
one of ordinary skill in the art may easily derive the genomic DNA corresponding to the 
cDNA of a cancer associated antigen. Thus, the present invention also provides for antisense 
oligonucleotides which are complementary to the genomic DNA corresponding to nucleic 
acids encoding cancer associated antigens. Similarly, antisense to allelic or homologous 
cDNAs and genomic DNAs are enabled without undue experimentation. 

In one set of embodiments, the antisense oligonucleotides of the invention may be 
composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof. 
That is, the 5' end of one native nucleotide and the 3' end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester internucleoside linkage. These 
oligonucleotides may be prepared by art recognized methods which may be carried out 
manually or by an automated synthesizer. They also may be produced recombinantly by 
vectors. 

In preferred embodiments, however, the antisense oligonucleotides of the invention 
also may include "modified" oligonucleotides. That is, the oligonucleotides may be modified 
in a number of ways which do not prevent them from hybridizing to their target but which 
enhance their stability or targeting or which otherwise enhance their therapeutic effectiveness. 
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The term "modified oligonucleotide" as used herein describes an oligonucleotide in 
which (1) at least two of its nucleotides are covalently linked via a synthetic intemucleoside 
linkage (i.e., a linkage other than a phosphodiester linkage between the 5' end of one 
nucleotide and the 3' end of another nucleotide) and/or (2) a chemical group not normally 
associated with nucleic acids has been covalently attached to the oligonucleotide. Preferred 
synthetic intemucleoside linkages are phosphorothioates, alkylphosphonates, 
phosphorodithioates, phosphate esters, alkylphosphonothioates, phosphoramidates, 
carbamates, carbonates, phosphate triesters, acetamidates, carboxymethyl esters and peptides. 

The term "modified oligonucleotide" also encompasses oligonucleotides with a 
covalently modified base and/or sugar. For example, modified oligonucleotides include 
oligonucleotides having backbone sugars which are covalently attached to low molecular 
weight organic groups other than a hydroxyl group at the 3' position and other than a 
phosphate group at the 5' position. Thus modified oligonucleotides may include a 2'-0- 
alkylated ribose group. In addition, modified oligonucleotides may include sugars such as 
arabinose instead of ribose. The present invention, thus, contemplates pharmaceutical 
preparations containing modified antisense molecules that are complementary to and 
hybridizable with, under physiological conditions, nucleic acids encoding breast, gastric or 
prostate cancer associated antigen polypeptides, together with pharmaceutically acceptable 
20 carriers. 

Antisense oligonucleotides may be administered as part of a pharmaceutical 
composition. Such a pharmaceutical composition may include the antisense oligonucleotides 
in combination with any standard physiologically and/or pharmaceutically acceptable carriers 
which are known in the art. The compositions should be sterile and contain a therapeutically 
15 effective amount of the antisense oligonucleotides in a unit of weight or volume suitable for 
administration to a patient. The term "pharmaceutically acceptable" means a non-toxic 
material that does not interfere with the effectiveness of the biological activity of the active 
ingredients. The term "physiologically acceptable" refers to a non-toxic material that is 
compatible with a biological system such as a cell, cell culture, tissue, or organism. The 
0 characteristics of the carrier will depend on the route of administration. Physiologically and 
pharmaceutically acceptable carriers include diluents, fillers, salts, buffers, stabilizers, 
solubilizers, and other materials which are well known in the art, as further described below. 
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As used herein, a "vector" may be any of a number of nucleic acids into which a 
desired sequence may be inserted by restriction and ligation for transport between different 
genetic environments or for expression in a host cell. Vectors are typically composed of DNA 
although RNA vectors are also available. Vectors include, but are not limited to, plasmids, 
5 phagemids and virus genomes. A cloning vector is one which is able to replicate 

autonomously or integrated in the genome in a host cell, and which is further characterized by 
one or more endonuclease restriction sites at which the vector may be cut in a determinable 
fashion and into which a desired DNA sequence may be ligated such that the new 
recombinant vector retains its ability to replicate in the host cell. In the case of plasmids, 

10 replication of the desired sequence may occur many times as the plasmid increases in copy 
number within the host bacterium or just a single time per host before the host reproduces by 
mitosis. In the case of phage, replication may occur actively during a lytic phase or passively 
during a lysogenic phase. An expression vector is one into which a desired DNA sequence 
may be inserted by restriction and ligation such that it is operably joined to regulatory 

15 sequences and may be expressed as an RNA transcript. Vectors may further contain one or 
more marker sequences suitable for use in the identification of cells which have or have not 
been transformed or transfected with the vector. Markers include, for example, genes 
encoding proteins which increase or decrease either resistance or sensitivity to antibiotics or 
other compounds, genes which encode enzymes whose activities are detectable by standard 

20 assays known in the art (e.g., p-galactosidase, luciferase or alkaline phosphatase), and genes 
which visibly affect the phenotype of transformed or transfected cells, hosts, colonies or 
plaques (e.g., green fluorescent protein). Preferred vectors are those capable of autonomous 
replication and expression of the. structural gene products present in the DNA segments to 
which they are operably joined. 

25 As used herein, a coding sequence and regulatory sequences are said to be "operably" 

joined when they are covalently linked in such a way as to place the expression or 
transcription of the coding sequence under the influence or control of the regulatory 
sequences. If it is desired that the coding sequences be translated into a functional protein, 
two DNA sequences are said to be operably joined if induction of a promoter in the 5' 

30 regulatory sequences results in the transcription of the coding sequence and if the nature of the 
linkage between the two DNA sequences does not (1) result in the introduction of a frame- 
shift mutation, (2) interfere with the ability of the promoter region to direct the transcription 
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of the coding sequences, or (3) interfere with the ability of the corresponding RNA transcript 
to be translated into a protein. Thus, a promoter region would be operably joined to a coding 
sequence if the promoter region were capable of effecting transcription of that DNA sequence 
such that the resulting transcript might be translated into the desired protein or polypeptide. 
5 The precise nature of the regulatory sequences needed for gene expression may vary 

between species or cell types, but shall in general include, as necessary, 5' non-transcribed 
and 5* non-translated sequences involved with the initiation of transcription and translation 
respectively, such as a TATA box, capping sequence, CAAT sequence, and the like. 
Especially, such 5* non-transcribed regulatory sequences will include a promoter region 
10 which includes a promoter sequence for transcriptional control of the operably joined gene. 
Regulatory sequences may also include enhancer sequences or upstream activator sequences 
as desired The vectors of the invention may optionally include 5' leader or signal sequences. 
The choice and design of an appropriate vector is within the ability and discretion of one of 
ordinary skill in the art. 
15 Expression vectors containing all the necessary elements for expression are 

commercially available and known to those skilled in the art. See, e.g., Sambrook et al., 
Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory 
Press, 1989. Cells are genetically engineered by the introduction into the cells of heterologous 
DNA (RNA) encoding a cancer associated antigen polypeptide or fragment or variant thereof. 
20 That heterologous DNA (RNA) is placed under operable control of transcriptional elements 
to permit the expression of the heterologous DNA in the host cell. 

Preferred systems for mRNA expression in mammalian cells are those such as 
pRc/CMV (available from Invitrogen, Carlsbad, CA) that contain a selectable marker such as 
a gene that confers G418 resistance (which facilitates the selection of stably transfected cell 
25 lines) and the human cytomegalovirus (CMV) enhancer-promoter sequences. Additionally, 
suitable for expression in primate or canine cell lines is the pCEP4 vector (Invitrogen), which 
contains an Epstein Barr Virus (EB V) origin of replication, facilitating the maintenance of 
plasmid as a multicopy extrachromosomal element Another expression vector is the pEF- 
BOS plasmid containing the promoter of polypeptide Elongation Factor la, which stimulates 
30 efficiently transcription in vitro. The plasmid is described by Mishizuma and Nagata (Nuc. 
Acids Res. 18:5322, 1990), and its use in transfection experiments is disclosed by, for 
example, Demoulin (MoL Cell. Biol 16:4710-4716, 1996). Still another preferred expression 
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vector is an adenovirus, described by Stratford-Perricaudet, which is defective for El and E3 
proteins (J. Clin. Invest. 90:626-630, 1992). The use of the adenovirus as an Adeno.Pl A 
recombinant for the expression of an antigen is disclosed by Warnier et al., in intradermal 
injection in mice for immunization against PI A (Int. J. Cancer> 67:303-310, 1996). 
Additional vectors for delivery of nucleic acid are provided below. 

The invention also embraces so-called expression kits, which allow the artisan to 
prepare a desired expression vector or vectors. Such expression kits include at least separate 
portions of a vector and one or more of the previously discussed cancer associated antigen 
nucleic acid molecules. Other components may be added, as desired, as long as the previously 
mentioned nucleic acid molecules, which are required, are included. The invention also 
includes kits for amplification of a cancer associated antigen nucleic acid, including at least 
one pair of amplification primers which hybridize to a cancer associated antigen nucleic acid. 
The primers preferably are 12-32 nucleotides in length and are non-overlapping to prevent 
formation of "primer-dimers". One of the primers will hybridize to one strand of the cancer 
associated antigen nucleic acid and the second primer will hybridize to the complementary 
strand of the cancer associated antigen nucleic acid, in an arrangement which permits 
amplification of the cancer associated antigen nucleic acid. Selection of appropriate primer 
pairs is standard in the art. For example, the selection can be made with assistance of a 
computer program designed for such a purpose, optionally followed by testing the primers for 
amplification specificity and efficiency. 

The invention also permits the construction of cancer associated antigen gene 4i knock- 
outs" and transgenic overexpression in cells and in animals, providing materials for studying 
certain aspects of cancer and immune system responses to cancer. 

The invention also provides isolated polypeptides (including whole proteins and 
partial proteins) encoded by the foregoing cancer associated antigen nucleic acids. Such 
polypeptides are useful, for example, alone or as fusion proteins to generate antibodies, as 
components of an immunoassay or diagnostic assay or as therapeutics. Cancer associated 
antigen polypeptides can be isolated from biological samples including tissue or cell 
homogenates, and can also be expressed recombinantly in a variety of prokaryotic and 
eukaryotic expression systems by constructing an expression vector appropriate to the 
expression system, introducing the expression vector into the expression system, and isolating 
the recombinantly expressed protein. Short polypeptides, including antigenic peptides (such 
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as are presented by MHC molecules on the surface of a cell for immune recognition) also can 
be synthesized chemically using well-established methods of peptide synthesis. 

A unique fragment of a cancer associated antigen polypeptide, in general, has the 
features and characteristics of unique fragments as discussed above in connection with nucleic 
5 acids. As will be recognized by those skilled in the art, the size of the unique fragment will 
depend upon factors such as whether the fragment constitutes a portion of a conserved protein 
domain. Thus, some regions of cancer associated antigens will require longer segments to be 
unique while others will require only short segments, typically between 5 and 12 amino acids 
(e.g. 5, 6, 7, 8, 9, 10, 1 1 or 12 or more amino acids including each integer up to the full 
10 length). 

Unique fragments of a polypeptide preferably are those fragments which retain a 
distinct functional capability of the polypeptide. Functional capabilities which can be retained 
in a unique fragment of a polypeptide include interaction with antibodies, interaction with 
other polypeptides or fragments thereof, selective binding of nucleic acids or proteins, and 
15 enzymatic activity. One important activity is the ability to act as a signature for identifying 
the polypeptide. Another is the ability to complex with HLA and to provoke in a human an 
immune response. Those skilled in the art are well versed in methods for selecting unique 
amino acid sequences, typically on the basis of the ability of the unique fragment to 
selectively distinguish the sequence of interest from non-family members. A comparison of 
20 the sequence of the fragment to those on known databases typically is all that is necessary. 
The invention embraces variants of the cancer associated antigen polypeptides 
described above. As used herein, a **variant" of a cancer associated antigen polypeptide is a 
polypeptide which contains one or more modifications to the primary amino acid sequence of 
a cancer associated antigen polypeptide. Modifications which create a cancer associated 
25 antigen variant can be made to a cancer associated antigen polypeptide 1) to reduce or 

eliminate an activity of a cancer associated antigen polypeptide; 2) to enhance a property of a 
cancer associated antigen polypeptide, such as protein stability in an expression system or the 
stability of protein-protein binding; 3) to provide a novel activity or property to a cancer 
associated antigen polypeptide, such as addition of an antigenic epitope or addition of a 
30 detectable moiety; or 4) to provide equivalent or better binding to an HLA molecule. 

Modifications to a cancer associated antigen polypeptide are typically made to the nucleic acid 
which encodes the cancer associated antigen polypeptide, and can include deletions, point 
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mutations, truncations, amino acid substitutions and additions of amino acids or non-amino 
acid moieties. Alternatively, modifications can be made directly to the polypeptide, such as 
by cleavage, addition of a linker molecule, addition of a detectable moiety, such as biotin, 
addition of a fatty acid, substitution of L-amino acids with D-amino acids, and the like. 

5 Modifications also embrace fusion proteins comprising all or part of the cancer associated 
antigen amino acid sequence. One of skill in the art will be familiar with methods for 
predicting the effect on protein conformation of a change in protein sequence, and can thus 
"design" a variant cancer associated antigen polypeptide according to known methods. One 
example of such a method is described by Dahiyat and Mayo in Science 278:82-87, 1997, 

10 whereby proteins can be designed de novo. The method can be applied to a known protein to 
vary a only a portion of the polypeptide sequence. By applying the computational methods of 
Dahiyat and Mayo, specific variants of a cancer associated antigen polypeptide can be 
proposed and tested to determine whether the variant retains a desired conformation. Other 
computational and computer modeling methods for designing polypeptide mimetics which 

15 retain activity of the polypeptides described herein, as well as selection methods such as phage 
display of peptide libraries are known in the art. 

In general, variants include cancer associated antigen polypeptides which are modified 
specifically to alter a feature of the polypeptide unrelated to its desired physiological activity. 
For example, cysteine residues can be substituted or deleted to prevent unwanted disulfide 

20 linkages. Similarly, certain amino acids can be changed to enhance expression of a cancer 
associated antigen polypeptide by eliminating proteolysis by proteases in an expression system 
(e.g., dibasic amino acid residues in yeast expression systems in which KEX2 protease 
activity i& present). 

Mutations of a nucleic acid which encode a cancer associated antigen polypeptide 
25 preferably preserve the amino acid reading frame of the coding sequence, and preferably do 
not create regions in the nucleic acid which are likely to hybridize to form secondary 
structures, such a hairpins or loops, which can be deleterious to expression of the variant 
polypeptide. 

Mutations can be made by selecting an amino acid substitution, or by random 
30 mutagenesis of a selected site in a nucleic acid which encodes the polypeptide. Variant 
polypeptides are then expressed and tested for one or more activities to determine which 
mutation provides a variant polypeptide with the desired properties. Further mutations can be 
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made to variants (or to non-variant cancer associated antigen polypeptides) which are silent as 
to the amino acid sequence of the polypeptide, but which provide preferred codons for 
translation in a particular host The preferred codons for translation of a nucleic acid in, e.g., 
E. coli, are well known to those of ordinary skill in the art. Still other mutations can be made 
to the noncoding sequences of a cancer associated antigen gene or cDNA clone to enhance 
expression of the polypeptide. The activity of variants of cancer associated antigen 
polypeptides can be tested by cloning the gene encoding the variant cancer associated antigen 
polypeptide into a bacterial or mammalian expression vector, introducing the vector into an 
appropriate host cell, expressing the variant cancer associated antigen polypeptide, and testing 
for a functional capability of the cancer associated antigen polypeptides as disclosed herein. 
For example, the variant cancer associated antigen polypeptide can be tested for reaction with 
autologous or allogeneic sera as disclosed in the Examples. Preparation of other variant 
polypeptides may favor testing of other activities, as will be known to one of ordinary skill in 
the art 

The skilled artisan will also realize that conservative amino acid substitutions may be 
made in cancer associated antigen polypeptides to provide functionally equivalent variants of 
the foregoing polypeptides, i.e, the variants retain the functional capabilities of the cancer 
associated antigen polypeptides. As used herein, a "conservative amino acid substitution" 
refers to an amino acid substitution which does not alter the relative charge or size 
characteristics of the protein in which the amino acid substitution is made. Variants can be 
prepared according to methods for altering polypeptide sequence known to one of ordinary 
skill in the art such as are found in references which compile such methods, e.g. Molecular 
Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second Edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York, 1989, or Current Protocols in Molecular 
Biology, F.M. Ausubel, et al., eds., John Wiley & Sons, Inc., New York. Exemplary 
functionally equivalent variants of the cancer associated antigen polypeptides include 
conservative amino acid substitutions of in the amino acid sequences of proteins disclosed 
herein. Conservative substitutions of amino acids include substitutions made amongst amino 
acids within the following groups: (a) M; I, L.V; (b) F, Y, W; (c) K, R, H; (d) A, G; (e) STT;' 
(f)Q,N;and(g)E,D. 

For example, upon determining that a peptide derived from a cancer associated antigen 
polypeptide is presented by an MHC molecule and recognized by CTLs, one can make 
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conservative amino acid substitutions to the amino acid sequence of the peptide, particularly 
at residues which are thought not to be direct contact points with the MHC molecule. For 
example, methods for identifying functional variants of HLA class II binding peptides are 
provided in a published PCT application of Strominger and Wucherpfennig 
(PCT/US96/03182). Peptides bearing one or more amino acid substitutions also can be tested 
for concordance with known HLA/MHC motifs prior to synthesis using, e.g. the computer 
program described by D'Amaro and Drijfhout (D'Amaro et al., Human Immunol 43:13-18, 
1995; Drijfhout et al., Human Immunol. 43:1-12, 1995). The substituted peptides can then be 
tested for binding to the MHC molecule and recognition by CTLs when bound to MHC. 
These variants can be tested for improved stability and are useful, inter alia, in vaccine 
compositions. 

Conservative amino-acid substitutions in the amino acid sequence of cancer associated 
antigen polypeptides to produce functionally equivalent variants of cancer associated antigen 
polypeptides typically are made by alteration of a nucleic acid encoding a cancer associated 
antigen polypeptide. Such substitutions can be made by a variety of methods known to one of 
ordinary skill in the art. For example, amino acid substitutions may be made by PCR-directed 
mutation, site-directed mutagenesis according to the method of Kunkel (Kunkel, Proa Nat. 
Acad. Sci. U.SA. 82: 488-492, 1985), or by chemical synthesis of a gene encoding a cancer 
associated antigen polypeptide. Where amino acid substitutions are made to a small unique 
fragment of a cancer associated antigen polypeptide, such as an antigenic epitope recognized 
by autologous or allogeneic sera or cytolytic T lymphocytes, the substitutions can be made by 
directly synthesizing the peptide. The activity of functionally equivalent fragments of cancer 
associated antigen polypeptides can be tested by cloning the gene encoding the altered cancer 
associated antigen polypeptide into a bacterial or mammalian expression vector, introducing 
the vector into an appropriate host cell, expressing the altered cancer associated antigen 
polypeptide, and testing for a functional capability of the cancer associated antigen 
polypeptides as disclosed herein. Peptides which are chemically synthesized can be tested 
directly for function, e.g., for binding to antisera recognizing associated antigens. 

The invention as described herein has a number of uses, some of which are described 
elsewhere herein. First, the invention permits production and/or isolation of the cancer 
associated antigen protein molecules. A variety of methodologies well-known to the skilled 
practitioner can be utilized to obtain isolated cancer associated antigen molecules. The 
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polypeptide may be purified from cells which naturally produce the polypeptide by 
chromatographic means or immunological recognition. Alternatively, an expression vector 
may be introduced into cells to cause production of the polypeptide. In another method, 
mRNA transcripts may be microinjected or otherwise introduced into cells to cause 
5 production of the encoded polypeptide. Translation of mRNA in cell-free extracts such as the 
reticulocyte lysate system also may be used to produce polypeptide. Those skilled in the art 
also can readily follow known methods for isolating cancer associated antigen polypeptides. 
These include, but are not limited to, immunochromatography, HPLC, size-exclusion 
chromatography, ion-exchange chromatography and immune-affinity chromatography. 
3 The isolation and identification of cancer associated antigen genes also makes it 

possible for the artisan to diagnose a disorder characterized by expression of cancer associated 
antigens. These methods involve determining expression of one or more cancer associated 
antigen nucleic acids, and/or encoded cancer associated antigen polypeptides and/or peptides 
derived therefrom. In the former situation, such determinations can be carried out via any 
standard nucleic acid determination assay, including the polymerase chain reaction, or 
assaying with labeled hybridization probes. In the latter situation, such determinations can be 
carried out by screening patient antisera for recognition of the polypeptide. 

The invention also makes it possible isolate proteins which bind to cancer associated 
antigens as disclosed herein, including antibodies and cellular binding partners of the cancer 
associated antigens. Additional uses are described further herein. 

The invention also provides, in certain embodiments, "dominant negative" 
polypeptides derived from cancer associated antigen polypeptides. A dominant negative 
polypeptide is an inactive variant of a protein, which, by interacting with the cellular 
machinery, displaces an active protein from its interaction with the cellular machinery or 
competes with the active protein, thereby reducing the effect of the active protein. For 
example, a dominant negative receptor which binds a ligand but does not transmit a signal in 
response to binding of the ligand can reduce the biological effect of expression of the ligand. 
Likewise, a dominant negative catalytically-inactive kinase which interacts normally with 
target proteins but does not phosphorylate the target proteins can reduce phosphorylation of 
the target proteins in response to a cellular signal. Similarly, a dominant negative 
transcription factor which binds to a promoter site in the control region of a gene but does not 
increase gene transcription can reduce the effect of a normal transcription factor by occupying 
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promoter binding sites without increasing transcription. 

The end result of the expression of a dominant negative polypeptide in a cell is a 
reduction in function of active proteins. One of ordinary skill in the art can assess the 
potential for a dominant negative variant of a protein, and using standard mutagenesis 
techniques to create one or more dominant negative variant polypeptides. For example, given 
the teachings contained herein of breast, gastric and prostate cancer associated antigens, 
especially those which are similar to known proteins which have known activities, one of 
ordinary skill in the art can modify the sequence of the cancer associated antigens by site- 
specific mutagenesis, scanning mutagenesis, partial gene deletion or truncation, and the like. 
See, e.g., U.S. Patent No. 5,580,723 and Sambrook et al., Molecular Cloning: A Laboratory 
Mm«<z/,Second Edition, Cold SpringHarbor Laboratory Press, 1989. The skilled artisan then 
can test the population of mutagenized polypeptides for diminution in a selected and/or for 
retention of such an activity. Other similar methods for creating and testing dominant 
negative variants of a protein will be apparent to one of ordinary skill in the art. 

The invention also involves agents such as polypeptides which bind to cancer 
associated antigen polypeptides. Such binding agents can be used, for example, in screening 
assays to detect the presence or absence of cancer associated antigen polypeptides and 
complexes of cancer associated antigen polypeptides and their binding partners and in 
purification protocols to isolated cancer associated antigen polypeptides and complexes of 
cancer associated antigen polypeptides and their binding partners. Such agents also can be 
used to inhibit the native activity of the cancer associated antigen polypeptides, for example, 
by binding to such polypeptides. 

The invention, therefore, embraces peptide binding agents which, for example, can be 
antibodies or fragments of antibodies having the ability to selectively bind to cancer 
associated antigen polypeptides. Antibodies include polyclonal and monoclonal antibodies, 
prepared according to conventional methodology. 

Significantly, as is well-known in the art, only a small portion of an antibody 
molecule, the paratope, is involved in the binding of the antibody to its epitope (see, in 
general, Clark, W.R. (1986) The Experimental Foundations of Modern Immunology. Wiley & 
Sons, Inc., New York; Roitt, I. (1991) Essential Immunology, 7th Ed., Blackwell Scientific 
Publications, Oxford). The pFC and Fc regions, for example, are effectors of the complement 
cascade but are not involved in antigen binding. An antibody from which the pFc' region has 
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been enzymatically cleaved, or which has been produced without the pFc' region, designated 
an F(ab')2 fragment, retains both of the antigen binding sites of an intact antibody. Similarly, 
an antibody from which the Fc region has been enzymatically cleaved, or which has been 
produced without the Fc region, designated an Fab fragment, retains one of the antigen 
5 binding sites of an intact antibody molecule. Proceeding further, Fab fragments consist of a 
covalently bound antibody light chain and a portion of the antibody heavy chain denoted Fd. 
The Fd fragments are the major determinant of antibody specificity (a single Fd fragment may 
be associated with up to ten different light chains without altering antibody specificity) and Fd 
fragments retain epitope-binding ability in isolation. 
> Within the antigen-binding portion of an antibody, as is well-known in the art, there 

are complementarity determining regions (CDRs), which directly interact with the epitope of 
the antigen, and framework regions (FRs), which maintain the tertiary structure of the 
paratope (see, in general, Clark, 1986; Roitt, 1991). In both the heavy chain Fd fragment and 
the light chain of IgG immunoglobulins, there are four framework regions (FR1 through FR4) 
separated respectively by three complementarity determining regions (CDR1 through CDR3). 
The CDRs, and in particular the CDR3 regions, and more particularly the heavy chain CDR3, 
are largely responsible for antibody specificity. 

It is now well-established in the art that the non-CDR regions of a mammalian 
antibody may be replaced with similar regions of conspecific or heterospecific antibodies 
while retaining the epitopic specificity of the original antibody. This is most clearly 
manifested in the development and use of "humanized" antibodies in which non-human CDRs 
are covalently joined to human FR and/or Fc/pFC regions to produce a functional antibody. 
See, e.g., U.S. patents 4,816,567, 5,225,539, 5,585,089, 5,693,762 and 5,859,205. 

Thus, for example, PCT International Publication Number WO 92/04381 teaches the 
production and use of humanized murine RSV antibodies in which at least a portion of the 
murine FR regions have been replaced by FR regions of human origin. Such antibodies, 
including fragments of intact antibodies with antigen-binding ability, are often referred to as 
"chimeric" antibodies. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 
provides for F(ab') 2 , Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or FR 
and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric F(ab') 2 fragment antibodies in which 
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the FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric Fab fragment antibodies in which the 
FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; and chimeric Fd fragment antibodies in which 
the FR and/or CDR1 and/or CDR2 regions have been replaced by homologous human or non- 
human sequences. The present invention also includes so-called single chain antibodies. 

Thus, the invention involves polypeptides of numerous size and type that bind 
specifically to cancer associated antigen polypeptides, and complexes of both cancer 
associated antigen polypeptides and their binding partners. These polypeptides may be 
derived also from sources other than antibody technology. For example, such polypeptide 
binding agents can be provided by degenerate peptide libraries which can be readily prepared 
in solution, in immobilized form or as phage display libraries. Combinatorial libraries also 
can be synthesized of peptides containing one or more amino acids. Libraries further can be 
synthesized of peptoids and non-peptide synthetic moieties. 

Phage display can be particularly effective in identifying binding peptides useful 
according to the invention. Briefly, one prepares a phage library (using e.g. ml3, fd, or 
lambda phage), displaying inserts from 4 to about 80 amino acid residues using conventional 
procedures. The inserts may represent, for example, a completely degenerate or biased array. 
One then can select phage-bearing inserts which bind to the cancer associated antigen 
polypeptide. This process can be repeated through several cycles of reselection of phage that 
bind to the cancer associated antigen polypeptide. Repeated rounds lead to enrichment of 
phage bearing particular sequences. DNA sequence analysis can be conducted to identify the 
sequences of the expressed polypeptides. The minimal linear portion of the sequence that 
binds to the cancer associated antigen polypeptide can be determined. One can repeat the 
procedure using a biased library containing inserts containing part or all of the minimal linear 
portion plus one or more additional degenerate residues upstream or downstream thereof. 
Yeast two-hybrid screening methods also may be used to identify polypeptides that bind to the 
cancer associated antigen polypeptides. Thus, the cancer associated antigen polypeptides of 
the invention, or a fragment thereof, can be used to screen peptide libraries, including phage 
display libraries, to identify and select peptide binding partners of the cancer associated 
antigen polypeptides of the invention. Such molecules can be used, as described, for 
screening assays, for purification protocols, for interfering directly with the functioning of 
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cancer associated antigen and for other purposes that will be apparent to those of ordinary 
skill in the art. 

As detailed herein, the foregoing antibodies and other binding molecules may be used 
for example to identify tissues expressing protein or to purify protein. Antibodies also may be 
5 coupled to specific diagnostic labeling agents for imaging of cells and tissues that express 
cancer associated antigens or to therapeutically useful agents according to standard coupling 
procedures. Diagnostic agents include, but are not limited to, barium sulfate, iocetamic acid, 
iopanoic acid, ipodate calcium, diatrizoate sodium, diatrizoate meglumine, metrizamide, 
tyropanoate sodium and radiodiagnostics including positron emitters such as fiuorine-1 8 and 
10 carbon-1 1 , gamma emitters such as iodine-123, technetium-99m, iodine-1 3 1 and indium-1 1 1 , 
and nuclides for nuclear magnetic resonance such as fluorine and gadolinium. Other 
diagnostic agents useful in the invention will be apparent to one of ordinary skill in the art. 
As used herein, "therapeutically useful agents'* include any therapeutic molecule which 
desirably is targeted selectively to a cell expressing one of the cancer antigens disclosed 
15 herein, including antineoplastic agents, radioiodinated compounds, toxins, other cytostatic or 
cytolytic drugs, and so forth. Antineoplastic therapeutics are well known and include: 
aminoglutethimide, azathioprine, bleomycin sulfate, busulfan, cannustine, chlorambucil, 
cisplatin, cyclophosphamide, cyclosporine, cytarabidine, dacarbazine, dactinomycin, 
daunorubicin, doxorubicin, taxol, etoposide, fluorouracil, interferon-a, lomustine, 
mercaptopurine, methotrexate, mitotane, procarbazine HC1, thioguanine, vinblastine sulfate 
and vincristine sulfate. Additional antineoplastic agents include those disclosed in Chapter 
52, Antineoplastic Agents (Paul Calabresi and Bruce A. Chabner), and the introduction 
thereto, 1202-1263, of Goodman and Gilman's "The Pharmacological Basis of Therapeutics", 
Eighth Edition, 1990, McGraw-Hill, Inc. (Health Professions Division). Toxins can be 
25 proteins such as, for example, pokeweed anti-viral protein, cholera toxin, pertussis toxin, 
ricin, gelonin, abrin, diphtheria exotoxin, or Pseudomonas exotoxin. Toxin moieties can also 
be high energy-emitting radionuclides such as cobalt-60. 

In the foregoing methods and compositions, antibodies prepared according to the 
invention also preferably are specific for the cancer associated antigen/MHC complexes 
30 described herein. 

When "disorder" is used herein, it refers to any pathological condition where the 
cancer associated antigens are expressed. An example of such a disorder is cancer, including 
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breast, gastric and prostate cancer as particular examples. 

Samples of tissue and/or cells for use in the various methods described herein can be 
obtained through standard methods such as tissue biopsy, including punch biopsy and cell 
scraping, and collection of blood or other bodily fluids by aspiration or other methods. 

5 In certain embodiments of the invention, an immunoreactive cell sample is removed 

from a subject. By "immunoreactive cell" is meant a cell which can mature into an immune 
cell (such as a B cell, a helper T cell, or a cytolytic T cell) upon appropriate stimulation. Thus 
immunoreactive cells include CD34 + hematopoietic stem cells, immature T cells and 
immature B cells. When it is desired to produce cytolytic T cells which recognize a cancer 

10 associated antigen, the immunoreactive cell is contacted with a cell which expresses a cancer 
associated antigen under conditions favoring production, differentiation and/or selection of 
cytolytic T cells; the differentiation of the T cell precursor into a cytolytic T cell upon 
exposure to antigen is similar to clonal selection of the immune system. 

Some therapeutic approaches based upon the disclosure are premised on a response by 

15 a subject's immune system, leading to lysis of antigen presenting cells, such as breast, gastric 
or prostate cancer cells which present one or more cancer associated antigens. One such 
approach is the administration of autologous CTLs specific to a cancer associated 
antigen/MHC complex to a subject with abnormal cells of the phenotype at issue. It is within 
the ability of one of ordinary skill in the art to develop such CTLs in vitro. An example of a 

20 method for T cell differentiation is presented in International Application number 

PCT/US96/05607. Generally, a sample of cells taken from a subject, such as blood cells, are 
contacted with a cell presenting the complex and capable of provoking CTLs to proliferate. 
The target cell can be a transfectant, such as a COS cell. These transfectants present the 
desired complex at their surface and, when combined with a CTL of interest, stimulate its 

25 proliferation. COS cells are widely available, as are other suitable host cells. Specific 

production of CTL clones is well known in the art The clonally expanded autologous CTLs 
then are administered to the subject. 

CTL proliferation can be increased by increasing the level of tryptophan in T cell 
cultures, by inhibiting enzymes which catabolize tryptophan, such as indoleamine 2,3- 

30 dioxygenase (IDO), or by adding tryptophan to the culture. Proliferation of T cells is 

enhanced by increasing the rate of proliferation and/or extending the number of divisions of 
the T cells in culture. In addition, increasing tryptophan in T cell cultures also enhances the 
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lytic activity of the T cells grown in culture. 

Another method for selecting antigen-specific CTL clones has recently been described 
(Altaian et al., Science 274:94-96, 1996; Dunbar et al., Curr. Biol. 8:413-416, 1998), in which 
fluorogenic tetramers of MHC class I molecule/peptide complexes are used to detect' specific 
5 CTL clones. Briefly, soluble MHC class I molecules are folded in vitro in the presence of fc- 
microglobulin and a peptide antigen which binds the class I molecule. After purification,, the 
MHC/peptide complex is purified and labeled with biotin. Tetramers are formed by mixing 
the biotinylated peptide-MHC complex with labeled avidin (e.g. phycoerythrin) at a molar 
ratio or 4:1. Tetramers are then contacted with a source of CTLs such as peripheral blood or 
3 lymph node. The tetramers bind CTLs which recognize the peptide antigen/MHC class I 
complex. Cells bound by the tetramers can be sorted by fluorescence activated cell sorting to 
isolate the reactive CTLs. The isolated CTLs then can be expanded in vitro for use as 
described herein. 

To detail a therapeutic methodology, referred to as adoptive transfer (Greenberg, J. 
Immunol. 136(5): 1917, 1986; Riddel et al., Science 257: 238, 1992; Lynch et al, Eur. J. 
Immunol. 21: 1403-1410,1991; Kast et al., Ce//59: 603-614, 1989), cells presenting the 
desired complex (e.g., dendritic cells) are combined with CTLs leading to proliferation of the 
CTLs specific thereto. The proliferated CTLs are then administered to a subject with a 
cellular abnormality which is characterized by certain of the abnormal cells presenting the- 
particular complex. The CTLs then lyse the abnormal cells, thereby achieving the desired 
therapeutic goal. 

The foregoing therapy assumes that at least some of the subject's abnormal cells 
present the relevant HLA/cancer associated antigen complex. This can be determined very 
easily, as the art is very familiar with methods for identifying cells which present a particular 
HLA molecule, as well as how to identify cells expressing DNA of the pertinent sequences, in 
this case a cancer associated antigen sequence. Once cells presenting the relevant complex are 
identified via the foregoing screening methodology, they can be combined with a sample from 
a patient, where the sample contains CTLs. If the complex presenting cells are lysed by the 
mixed CTL sample, then it can be assumed that a cancer associated antigen is being presented, 
and the subject is an appropriate candidate for the therapeutic approaches set forth supra. 

Adoptive transfer is not the only form of therapy that is available in accordance with 
the invention. CTLs can also be provoked in vivo, using a number of approaches. One 
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approach is the use of non-proliferative cells expressing the complex. The cells used in this 
approach may be those that normally express the complex, such as irradiated tumor cells or 
cells transfected with one or both of the genes necessary for presentation of the complex (i.e. 
the antigenic peptide and the presenting HLA molecule). Chen et al. (Proc. Natl. Acad. Sci. 

5 USA 88: 1 10-1 14,1991) exemplifies this approach, showing the use of transfected cells 
expressing HPV-E7 peptides in a therapeutic regime. Various cell types may be used. 
Similarly, vectors carrying one or both of the genes of interest may be used. Viral or bacterial 
vectors are especially preferred. For example, nucleic acids which encode a cancer associated 
antigen polypeptide or peptide may be operably linked to promoter and enhancer sequences 

10 which direct expression of the cancer associated antigen polypeptide or peptide in certain 
tissues or cell types. The nucleic acid may be incorporated into an expression vector. 
Expression vectors may be unmodified extrachromosomal nucleic acids, plasmids or viral 
genomes constructed or modified to enable insertion of exogenous nucleic acids, such as those 
encoding cancer associated antigens, as described elsewhere herein. Nucleic acids encoding 

15 one or more cancer associated antigens also may be inserted into a retroviral genome, thereby 
facilitating integration of the nucleic acid into the genome of the target tissue or cell type. In 
these systems, the gene of interest is carried by a microorganism, e.g., a Vaccinia virus, pox 
virus, herpes simplex virus, retrovirus or adenovirus, and the materials de facto "infect" host 
cells. The cells which result present the complex of interest, and are recognized by 

20 autologous CTLs, which then proliferate. 

A similar effect can be achieved by combining the cancer associated antigen or a 
stimulatory fragment thereof with an adjuvant to facilitate incorporation into antigen 
presenting cells in vivo. The cancer associated antigen polypeptide is processed to yield the 
peptide partner of the HLA molecule while a cancer associated antigen peptide may be 

25 presented without the need for further processing. Generally, subjects can receive an 

intradermal injection of an effective amount of the cancer associated antigen. Initial doses can 
be followed by booster doses, following immunization protocols standard in the art. Preferred 
cancer associated antigens include those found to react with allogeneic cancer antisera, shown 
in the examples below. 

30 The invention involves the use of various materials disclosed herein to "immunize" 

subjects or as "vaccines". As used herein, "immunization" or "vaccination" means increasing 
or activating an immune response against an antigen. It does not require elimination or 
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eradication of a condition but rather contemplates the clinically favorable enhancement of an 
immune response toward an antigen. Generally accepted animal models can be used for 
testing of immunization against cancer using a cancer associated antigen nucleic acid. For 
example, human cancer cells can be introduced into a mouse to create a tumor, and one or 
more cancer associated antigen nucleic acids can be delivered by the methods described 
herein. The effect on the cancer cells (e.g., reduction of tumor size) can be assessed as a 
measure of the effectiveness of the cancer associated antigen nucleic acid immunization. Of 
course, testing of the foregoing animal model using more conventional methods for 
immunization can include the administration of one or more cancer associated antigen 
polypeptides or peptides derived therefrom, optionally combined with one or more adjuvants 
and/or cytokines to boost the immune response. Methods for immunization, including 
formulation of a vaccine composition and selection of doses, route of administration and the 
schedule of administration (e.g. primary and one or more booster doses), are well known in 
the art The tests also can be performed in humans, where the end point is to test for the 
presence of enhanced levels of circulating CTLs against cells bearing the antigen, to test for 
levels of circulating antibodies against the antigen, to test for the presence of cells expressing 
the antigen and so forth. 

As part of the immunization compositions, one or more cancer associated antigens or 
stimulatory fragments thereof are administered with one or more adjuvants to induce an 
immune response or to increase an immune response. An adjuvant is a substance incorporated 
into or administered with antigen which potentiates the immune response. Adjuvants may 
enhance the immunological response by providing a reservoir of antigen (extracellularly or 
within macrophages), activating macrophages and stimulating specific sets of lymphocytes. 
Adjuvants of many kinds are well known in the art Specific examples of adjuvants include 
monophosphoryl lipid A (MPL, SmithKline Beecham), a congener obtained after purification 
and acid hydrolysis of Salmonella Minnesota Re 595 hpopolysaccharide; saponins including 
QS21 (SmithKline Beecham), a pure QA-21 saponin purified from Quillja saponaria extract; 
DQS21, described in PCT application W096/33739 (SmithKline Beecham); QS-7, QS-17, 
QS-18, and QS-L1 (So et al., Mol Cells 7:178-186; 1997); incomplete Freund's adjuvant; 
30 complete Freund's adjuvant; montanide; alum; CpG oligonucleotides (see e.g. Kreig et al., 
Nature 374:546-9, 1995); and various water-in-oil emulsions prepared from biodegradable 
oils such as squalene and/or tocopherol. Preferably, the peptides are administered mixed with 
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a combination of DQS21/MPL. The ratio of DQS21 to MPL typically will be about 1:10 to 
10:1, preferably about 1:5 to 5:1 and more preferably about 1:1. Typically for human 
administration, DQS21 and MPL will be present in a vaccine formulation in the range of 
about 1 \ig to about 100 ug. Other adjuvants are known in the art and can be used in the 

5 invention (see, e.g. Goding, Monoclonal Antibodies: Principles and Practice, 2nd Ed., 1 986). 
Methods for the preparation of mixtures or emulsions of peptide and adjuvant are well known 
to those of skill in the art of vaccination. 

Other agents which stimulate the immune response of the subject can also be 
administered to the subject For example, other cytokines are also useful in vaccination 

10 protocols as a result of their lymphocyte regulatory properties. Many other cytokines useful 
for such purposes will be known to one of ordinary skill in the art, including interleukin-12 
(TJL-12) which has been shown to enhance the protective effects of vaccines (see, e.g., Science 
268: 1432-1434, 1995), GM-CSF and IL-18. Thus cytokines can be administered in 
conjunction with antigens and adjuvants to increase the immune response to the antigens. 

15 There are a number of immune response potentiating compounds that can be used in 

vaccination protocols." These include costimulatory molecules provided in either protein or 
nucleic acid form. Such costimulatory molecules include the B7-1 and B7-2 (CD80 and CD86 
respectively) molecules which are expressed on dendritic cells (DC) and interact with the 
CD28 molecule expressed on the T cell. This interaction provides costimulation (signal 2) to 

20 an antigen/MHC/TCR stimulated (signal 1) T cell, increasing T cell proliferation and effector 
function. B7 also interacts with CTLA4 (CD152) on T cells and studies involving CTLA4 
and B7 ligands indicate that the B7-CTLA4 interaction can enhance antitumor immunity and 
CTL proliferation (Zheng P., et al. Proc. Natl. Acad. Set USA 95 (1 l):6284-6289 (1998)). 

B7 typically is not expressed on tumor cells so they are not efficient antigen presenting 

25 cells (APCs) for T cells. Induction of B7 expression would enable the tumor cells to stimulate 
more efficiently CTL proliferation and effector function. A combination of B7/IL-6/IL-12 
costimulation has been shown to induce IFN-gamma and a Thl cytokine profile in the T cell 
population leading to further enhanced T cell activity (Gajewski et al., J. Immunol, 154:5637- 
5648 (1995)). Tumor cell transfection with B7 has ben discussed in relation to in vitro CTL 

30 expansion for adoptive transfer immunotherapy by Wang et al., (J. Immunol., 1 9: 1-8 (1986)). 
Other delivery mechanisms for the B7 molecule would include nucleic acid (naked DNA) 
immunization (Kim J., et al. Nat BiotechnoL, 15:7:641-646 (1997)) and recombinant viruses 
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such as adeno and pox (Wendtner et al., Gene Ther., 4:7:726-735 (1997)). These systems are 
all amenable to the construction and use of expression cassettes for the coexpression of B7 
with other molecules of choice such as the antigens or fragments) of antigens discussed 
herein (including polytopes) or cytokines. These delivery systems can be used for induction 
5 of the appropriate molecules in vitro and for in vivo vaccination situations. The use of anti- 
CD28 antibodies to directly stimulate T cells in vitro and in vivo could also be considered. 
Similarly, the inducible co-stimulatory molecule ICOS which induces T cell responses to 
foreign antigen could be modulated, for example, by use of anti-ICOS antibodies (Hutloff et 
al., Nature 397:263-266, 1999). 
10 Lymphocyte function associated antigen-3 (LFA-3) is expressed on APCs and some 

tumor cells and interacts with CD2 expressed on T cells. This interaction induces T cell IL-2 
and IFN-gamma production and can thus complement but not substitute, the B7/CD28 
costimulatory interaction (Parra et al., J. Immunol., 158:637-642 (1997), Fenton et al., J. 
Immunother., 21 :2:95-108 (1998)). 
15 Lymphocyte function associated antigen-1 (LFA-1) is expressed on leukocytes and 

interacts with ICAM-1 expressed on APCs and some tumor cells. This interaction induces T 
cell IL-2 and IFN-gamma production and can thus complement but not substitute, the 
B7/CD28 costimulatory interaction (Fenton et al., J. Immunother., 21:2:95-108 (1998)). LFA- 
1 is thus a further example of a costimulatory molecule that could be provided in a 
20 vaccination protocol in the various ways discussed above for B7. 

Complete CTL activation and effector function requires Th cell help through the 
interaction between the Th cell CD40L (CD40 ligand) molecule and the CD40 molecule 
expressed by DCs (Ridge et al., Nature, 393:474 (1998), Bennett et al., Nature, 393:478 
(1998), Schoenberger et al., Nature, 393:480 (1998)). This mechanism of this costimulatory 
signal is likely to involve upregulation of B7 and associated IL-6/IL-12 production by the DC 
(APC). The CD40-CD40L interaction thus complements the signal 1 (antigen/MHC-TCR) 
and signal 2 (B7-CD28) interactions. 

The use of anti-CD40 antibodies to stimulate DC cells directly, would be expected to 
enhance a response to tumor antigens which are normally encountered outside of a 
inflammatory context or are presented by non-professional APCs (tumor cells). In these 
situations Th help and B7 costimulation signals are not provided. This mechanism might be 
used in the context of antigen pulsed DC based therapies or in situations where Th epitopes 
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have not been defined within known cancer antigen precursors. 

A cancer associated antigen polypeptide, or a fragment thereof, also can be used to 
isolate their native binding partners. Isolation of such binding partners may be performed 
according to well-known methods. For example, isolated cancer associated antigen 
5 polypeptides can be attached to a substrate (e.g., chromatographic media, such as polystyrene 
beads, or a filter), and then a solution suspected of containing the binding partner may be 
applied to the substrate. If a binding partner which can interact with cancer associated antigen 
polypeptides is present in the solution, then it will bind to the substrate-bound cancer 
associated antigen polypeptide. The binding partner then may be isolated. 

10 It will also be recognized that the invention embraces the use of the cancer associated 

antigen cDNA sequences in expression vectors, as well as to transfect host cells and cell lines, 
be these prokaryotic (e.g., E. coif), or eukaryotic (e.g., dendritic cells, B cells, CHO cells, COS 
cells, yeast expression systems and recombinant baculovirus expression in insect cells). 
Especially useful are mammalian cells such as human, mouse, hamster, pig, goat, primate, etc. 

15 They may be of a wide variety of tissue types, and include primary cells and cell lines. 

Specific examples include keratinocytes, peripheral blood leukocytes, bone marrow stem cells 
and embryonic stem cells. The expression vectors require that the pertinent sequence, i.e., 
those nucleic acids described supra, be operably linked to a promoter. 

The invention also contemplates delivery of nucleic acids, polypeptides or peptides for 

20 vaccination. Delivery of polypeptides and peptides can be accomplished according to 

standard vaccination protocols which are well known in the art. In another embodiment, the 
delivery of nucleic acid is accomplished by ex vivo methods, i.e. by removing a cell from a 
subjects, genetically engineering the cell to include a cancer associated antigen* and 
reintroducing the engineered cell into the subject. One example of such a procedure is 

25 outlined in U.S. Patent 5,399,346 and in exhibits submitted in the file history of that patent, all 
of which are publicly available documents. In general, it involves introduction in vitro of a 
functional copy of a gene into a cell(s) of a subject, and returning the genetically engineered 
cell(s) to the subject. The functional copy of the gene is under operable control of regulatory 
elements which permit expression of the gene in the genetically engineered cell(s). Numerous 

30 transfection and transduction techniques as well as appropriate expression vectors are well 
known to those of ordinary skill in the art, some of which are described in PCT application 
WO95/00654. In vivo nucleic acid delivery using vectors such as viruses and targeted 
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liposomes also is contemplated according to the invention. 

In preferred embodiments, a virus vector for delivering a nucleic acid encoding a 
cancer associated antigen is selected from the group consisting of adenoviruses, adeno- 
associated viruses, poxviruses including vaccinia viruses and attenuated poxviruses, Semliki 
5 Forest virus, Venezuelan equine encephalitis virus, retroviruses, Sindbis virus, and Ty virus- 
like particle. Examples of viruses and virus-like particles which have been used to deliver 
exogenous nucleic acids include: replication-defective adenoviruses (e.g., Xiang et al., 
Virology 219:220-227, 1996; Eloit et al., J Virol 7:5375-5381, 1997; Chengalvala et aL, 
Vaccine 15:335-339, 1997), a modified retrovirus (Townsend et al, J. Virol 71:3365-3374, 
0 1997), a nonreplicating retrovirus (Irwin et al., J. Virol 68:5036-5044, 1994), a replication 
defective Semliki Forest virus (Zhao et al., Proc. Natl Acad. ScL USA 92:3009-3013, 1995), 
canarypox vims and highly attenuated vaccinia virus derivative (Paoletti, Proc. Natl Acad 
ScL USA 93:1 1349-1 1353, 1996), non-replicative vaccinia virus (Moss, Proc. Natl Acad. Set 
USA 93:1 1341-1 1348, 1996), replicative vaccinia virus (Moss, Dev. Biol Stand. 82:55-63, 
* 1994), Venzuelan equine encephalitis virus (Davis et al., J. Virol 70:3781-3787, 1996), 
Sindbis vims (Pugachev et al., Virology 212:587-594, 1995), and Ty virus-like particle 
(Allsopp et al., Eur. J. Immunol 26:1951-1959, 1996). Li preferred embodiments, the virus 
vector is an adenovirus. 

Another preferred virus for certain applications is the adeno-associated virus, a 
double-stranded DNA virus. The adeno-associated virus is capable of infecting a wide range 
of cell types and species and can be engineered to be replication-deficient. It further has 
advantages, such as heat and lipid solvent stability, high transduction frequencies in cells of 
diverse lineages, including hematopoietic cells, and lack of superinfection inhibition thus 
allowing multiple series of transductions. The adeno-associated virus can integrate into 
human cellular DNA in a site-specific manner, thereby minimizing the possibility of 
insertional mutagenesis and variability of inserted gene expression. In addition, wild-type 
adeno-associated virus infections have been followed in tissue culture for greater than 100 
passages in the absence of selective pressure, implying that the adeno-associated virus 
genomic integration is a relatively stable event. The adeno-associated virus can also function 
in an extrachromosomal fashion. 

In general, other preferred viral vectors are based on non-cytopathic eukaryotic viruses 
in which non-essential genes have been replaced with the gene of interest. Non-cytopathic 
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viruses include retroviruses, the life cycle of which involves reverse transcription of genomic 
viral RNA into DNA with subsequent proviral integration into host cellular DNA, 
Adenoviruses and retroviruses have been approved for human gene therapy trials. In general, 
the retroviruses are replication-deficient (i.e., capable of directing synthesis of the desired 
5 proteins, but incapable of manufacturing an infectious particle). Such genetically altered 
retroviral expression vectors have general utility for the high-efficiency transduction of genes 
in vivo. Standard protocols for producing replication-deficient retroviruses (including the 
steps of incorporation of exogenous genetic material into a plasmid, transfection of a 
packaging cell lined with plasmid, production of recombinant retroviruses by the packaging 

10 cell line, collection of viral particles from tissue culture media, and infection of the target cells 
with viral particles) are provided in Kriegler, M., "Gene Transfer and Expression, A 
Laboratory Manual," W.H. Freeman Co., New York (1990) and Murry, EJ. Ed, 'Methods in 
Molecular Biology," vol. 7, Humana Press, Inc., Clifton, New Jersey (1991). 

Preferably the foregoing nucleic acid delivery vectors: (1) contain exogenous genetic 

15 material that can be transcribed and translated in a mammalian cell and that can induce an 
immune response in a host, and (2) contain on a surface a ligand that selectively binds to a 
receptor on the surface of a target cell, such as a mammalian cell, and thereby gains entry to 
the target cell 

Various techniques may be employed for introducing nucleic acids of the invention 
20 into cells, depending on whether the nucleic acids are introduced in vitro or in vivo in a host. 
Such techniques include transfection of nucleic acid-CaP0 4 precipitates, transfection of 
nucleic acids associated with DEAE, transfection or infection with the foregoing viruses 
including the nucleic acid of interest, liposome mediated transfection, and the like. For 
certain uses, it is preferred to target the nucleic acid to particular cells. In such instances, a 
25 vehicle used for delivering a nucleic acid of the invention into a cell (e.g., a retrovirus, or 
other virus; a liposome) can have a targeting molecule attached thereto. For example, a 
molecule such as an antibody specific for a surface membrane protein on the target cell or a 
ligand for a receptor on the target cell can be bound to or incorporated within the nucleic acid 
delivery vehicle. Preferred antibodies include antibodies which selectively bind a cancer 
30 associated antigen, alone or as a complex with a MHC molecule. Especially preferred are 
monoclonal antibodies. Where liposomes are employed to deUver the nucleic acids of the 
invention, proteins which bind to a surface membrane protein associated with endocytosis 
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may be incorporated into the liposome formulation for targeting and/or to facilitate uptake. 
Such proteins include capsid proteins or fragments thereof tropic for a particular cell type, 
antibodies for proteins which undergo internalization in cycling, proteins that target 
intracellular localization and enhance intracellular half life, and the like. Polymeric delivery 
systems also have been used successfully to deliver nucleic acids into cells, as is known by 
those skilled in the art. Such systems even permit oral delivery of nucleic acids. 

The therapeutics of the invention can be administered by any conventional route, 
including injection or by gradual infusion over time. The administration may, for example, be 
oral, intravenous, intraperitoneal, intramuscular, intracavity, subcutaneous, or transdermal. 
When cancer associated antigen peptides are used for vaccination, modes of administration 
which effectively deliver the cancer associated antigen and adjuvant, such that an immune 
response to the antigen is increased, can be used. For administration of a cancer associated 
antigen peptide in adjuvant, preferred methods include intradermal, intravenous, 
intramuscular and subcutaneous administration. Although these are preferred embodiments, 
the invention is not limited by the particular modes of administration disclosed herein. 
Standard references in the art (e.g., Remington * Pharmaceutical Sciences, 18th edition, 1990) 
provide modes of administration and formulations for delivery of immunogens with adjuvant 
or in a non-adjuvant carrier. When antibodies are used therapeutically, a preferred route of 
administration is by pulmonary aerosol. Techniques for preparing aerosol delivery systems 
containing antibodies are well known to those of skill in the art Generally, such systems 
should utilize components which will not significantly impair the biological properties of the 
antibodies, such as the paratope binding capacity (see, for example, Sciarra and Cutie, 
"Aerosols," in Remington's Pharmace utical Sciences. 18th edition, 1990, pp 1694-1712; 
incorporated by reference). Those of skill in the art can readily determine the various 
parameters and conditions for producing antibody aerosols without resort to undue 
experimentation. When using antisense preparations of the invention, slow intravenous 
administration is preferred. 

The compositions of the invention are administered in effective amounts. An 
"effective amount" is that amount of a cancer associated antigen composition that alone, or 
together with further doses, produces the desired response, e.g. increases an immune response 
to the cancer associated antigen. In the case of treating a particular disease or condition 
characterized by expression of one or more cancer associated antigens, such as breast, gastric 
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or prostate cancers, the desired response is inhibiting the progression of the disease. This may 
involve only slowing the progression of the disease temporarily, although more preferably, it 
involves halting the progression of the disease permanently. This can be monitored by routine 
methods or can be monitored according to diagnostic methods of the invention discussed 

5 herein. The desired response to treatment of the disease or condition also can be delaying the 
onset or even preventing, the onset of the disease or condition. 

Such amounts will depend, of course, on the particular condition being treated, the 
severity of the condition, the individual patient parameters including age, physical condition, 
size and weight, the duration of the treatment, the nature of concurrent therapy (if any), the 

10 specific route of administration and like factors within the knowledge and expertise of the 
health practitioner. These factors are well known to those of ordinary skill in the art and can 
be addressed with no more than routine experimentation. It is generally preferred that a 
maximum dose of the individual components or combinations thereof be used, that is, the 
highest safe dose according to sound medical judgment. It will be understood by those of 

IS ordinary skill in the art, however, that a patient may insist upon a lower dose or tolerable dose 
for medical reasons, psychological reasons or for virtually any other reasons. 

The pharmaceutical compositions used in the foregoing methods preferably are sterile 
and contain an effective amount of cancer associated antigen or nucleic acid encoding cancer 
associated antigen for producing the desired response in a unit of weight or volume suitable 

20 for administration to a patient The response can, for example, be measured by detennining 
the immune response following administration of the cancer associated antigen composition 
via a reporter system by measuring downstream effects such as gene expression, or by 
measuring the physiological effects of the cancer associated antigen composition, such as 
regression of a tumor or decrease of disease symptoms. Other assays will be known to one of 

25 ordinary skill in the art and can be employed for measuring the level of the response. 

The doses of cancer associated antigen compositions (e.g., polypeptide, peptide, 
antibody, cell or nucleic acid) administered to a subject can be chosen in accordance with 
different parameters, in particular in accordance with the mode of administration used and the 
state of the subject. Other factors include the desired period of treatment. In the event that a 

30 response in a subject is insufficient at the initial doses applied, higher doses (or effectively 
higher doses by a different, more localized delivery route) may be employed to the extent that 
patient tolerance permits. 
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In general, for treatments for eliciting or increasing an immune response, doses of 
cancer associated antigen are formulated and administered in doses between 1 ng and 1 mg, 
and preferably between 10 ng and 100 ng, according to any standard procedure in the art. 
Where nucleic acids encoding cancer associated antigen of variants thereof are employed, 
5 doses of between 1 ng and 0. 1 mg generally will be formulated and administered according to 
standard procedures. Other protocols for the administration of cancer associated antigen 
compositions will be known to one of ordinary skill in the art, in which the dose amount, 
schedule of injections, sites of injections, mode of a<iministration (e.g., intra-tumoral) and the 
like vary from the foregoing. Administration of cancer associated antigen compositions to 
10 mammals other than humans, e.g. for testing purposes or veterinary therapeutic purposes, is 
carried out under substantially the same conditions as described above. 

When administered, the pharmaceutical compositions of the invention are applied in 
pharmaceutically-acceptable amounts and in pharmaceutically-acceptable preparations. The 
term "pharmaceutical^ acceptable" means a non-toxic material that does not interfere with 
15 the effectiveness of the biological activity of the active ingredients. Such preparations may 
routinely contain salts, buffering agents, preservatives, compatible carriers, and optionally 
other therapeutic agents. When used in medicine, the salts should be pharmaceutically 
acceptable, but non-pharmaceutically acceptable salts may conveniently be used to prepare 
pharmaceutically-acceptable salts thereof and are not excluded from the scope of the 
20 invention. Such pharmacologically and pharmaceutically-acceptable salts include, but are not 
limited to, those prepared from the following acids: hydrochloric, hydrobromic, sulfuric, 
nitric, phosphoric, maleic, acetic, salicylic, citric, formic, malonic, succinic, and the like. 
Also, pharmaceutically-acceptable salts can be prepared as alkaline metal or alkaline earth 
salts, such as sodium, potassium or calcium salts. 
25 A cancer associated antigen composition may be combined, if desired, with a 

pharmaceutically-acceptable carrier. The term "pharmaceutically-acceptable carrier" as used 
herein means one or more compatible solid or liquid fillers, diluents or encapsulating 
substances which are suitable for administration into a human. The term "carrier" denotes an 
organic or inorganic ingredient, natural or synthetic, with which the active ingredient is 
30 combined to facilitate the application. The components of the pharmaceutical compositions 
also are capable of being co-mingled with the molecules of the present invention, and with 
each other, in a manner such that there is no interaction which would substantially impair the 
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desired phannaceutical efficacy. 

The pharmaceutical compositions may contain suitable buffering agents, including: 
acetic acid in a salt; citric acid in a salt; boric acid in a salt; and phosphoric acid in a salt. 

The pharmaceutical compositions also may contain, optionally, suitable preservatives, 
5 such as: benzalkonium chloride; chlorobutanol; parabens and thimerosal. 

The pharmaceutical compositions may conveniently be presented in unit dosage form 
and may be prepared by any of the methods well-known in the art of pharmacy. All methods 
include the step of bringing the active agent into association with a carrier which constitutes 
one or more accessory ingredients. In general, the compositions are prepared by uniformly 
10 and intimately bringing the active compound into association with a liquid carrier, a finely 
divided solid carrier, or both, and then, if necessary, shaping the product. 

Compositions suitable for oral administration maybe presented as discrete units, such 
as capsules, tablets, lozenges, each containing a predetermined amount of the active 
compound. Other compositions include suspensions in aqueous liquids or non-aqueous 
15 liquids such as a syrup, elixir or an emulsion. 

Compositions suitable for parenteral administration conveniently comprise a sterile 
aqueous or non-aqueous preparation of cancer associated antigen polypeptides or nucleic 
acids, which is preferably isotonic with the blood of the recipient This preparation may be 
formulated according to known methods using suitable dispersing or wetting agents and 
20 suspending agents. The sterile injectable preparation also may be a sterile injectable solution 
or suspension in a non-toxic parenterally-acceptable diluent or solvent, for example, as a 
solution in 1,3-butane diol. Among the acceptable vehicles and solvents that may be 
employed are water, Ringer's solution, and isotonic sodium chloride solution. In addition, 
sterile, fixed oils are conventionally employed as a solvent or suspending medium. For this 
25 purpose any bland fixed oil may be employed including synthetic mono-or di-glycerides. In 
addition, fatty acids such as oleic acid may be used in the preparation of injectables. Carrier 
formulation suitable for oral, subcutaneous, intravenous, intramuscular, etc. administrations 
can be found in Remington 's Pharmaceutical Sciences, Mack Publishing Co., Easton, PA. 

30 Examples 

F.Yample 1: SEREX screening of breast gast ric and prostate cancer cells 

Breast, gastric and prostate cancer cDNA libraries were established, using standard 
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techniques, and the libraries were screened, using the SEREX methodology described by 
Sahin et al., Proc. Natl. Acad. Sci. USA 92: 1 1810 (1995), and by Chen et al., Proc. Natl. 
Acad. Sci. USA 94: 1914 (1997), each of which is incorporated by reference in its entirety. 
To be specific, total RNA was isolated by homogenizing tumor samples in 4M 
5 guanidinium thiocyanate/0.5% sodium N-lauryl sarcosine/25 mM EDTA followed by 
centrifugation in 5.7 M CsCI/25 mM sodium acetate/10 uM EDTA at 32,000 rpm. Total 
mRNA was removed by passing the sample over an oligo-dT cellulose column. The cDNA 
libraries were then constructed by taking 5 ug of mRNA, using standard methodologies to 
reverse transcribe the material. Breast cancer libraries were prepared from two different 
) breast cancer patients, referred to as "MT" and "MK". Gastric cancer libraries were prepared 
from a gastric cancer patient, referred to as "YS". 

The cDNA was used to construct a lambda phage library, and 500 phages were plated 
onto XLl-Blue MRF E. coli, and incubated for eight hours at 37°C. A nitrocellulose 
membrane was then placed on the plate, followed by overnight incubation. The membrane 
was then washed, four times, with Tris buffered saline (TBS) which contained 0.05% Tween, 
and was then immersed in TBS containing 5% non-fat dried milk. After one hour, the 
membrane was incubated with conjugates of peroxidase-goat anti human IgG specific for Fc 
portions of human antibodies (1:2000, diluted in TBS with 1% BSA). The incubation was 
carried out for one hour, at room temperature, and the membrane was then washed three times 
with TBS. Those clones which produced antibodies were visualized with 0.06% 
3,3'diaminobenzidine tetrachloride and 0.015% H 2 0 2 , in 50 mM Tris (pH 7.5). Any clones 
which produced immunoglobulin were marked, and then the membrane was washed, two 
further times, with TBS that contained 0.05% Tween, and then twice with "neat" TBS. 

The membranes were then incubated in 1:100 diluted patient serum, overnight, at 4°C. 
The patient serum had been pretreated. Specifically, 5 ml samples were diluted to 10 ml with 
TBS containing 1% bovine serum albumin, and 0.02% Na 3 N. The serum had been treated to 
remove antibodies to bacteriophage, by passing it through a 5 ml Sepharose column, to which 
a lysate of E. coli Y1090 had been attached, followed by passage over a second column which 
had E. coli lysate and lysate of E. coli infected with lambda bacteriophage. The screening was 
carried out five times. The samples were then diluted to 50 ml, and kept at -80°C, until used 
as described herein. 

Following the overnight incubation with the membrane, the membrane was washed 
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twice with TBS/0.05% Tween 20, and then once with TBS. A further incubation was carried 
out, using the protocols discussed supra, for the peroxidase labeled antibodies. 

The positive clones were then sequenced, using standard techniques. Following 
comparison of the sequences to information available in data banks, clones were resolved into 

5 known and unknown genes. Some clones corresponded to previously identified human 
proteins and nucleotide sequences, and other clones have not been identified in humans 
previously, although there were related molecules found in other species. Still other clones 
represent molecules for which no related sequences were found (most clones contained very 
short sections (e.g. 25 or fewer nucleotides) that corresponded to portions of unrelated 

10 sequences). Some GenBank accession numbers representative of sequences having homology 
to the cancer associated antigen nucleotide sequences of the invention are presented in Table 
1. All of the homologous sequences are accessible in publicly-available databases by 
reference to the sequences' accession numbers provided in Table 1 . 

15 Breast cancer clones: 

The nucleotide sequences of clones derived from breast cancer patients "MT" and 
"MK" are presented as SEQ ID NOs: 1-205. Polypeptides encoded by open reading frames of 
the nucleic acid clones are presented as SEQ ID Nos: 594-829; the correspondence between 
nucleic acid molecules and encoded polypeptides is shown in Table 2. 

20 

Gastric cancer clones: 

The nucleotide sequences of clones derived from gastric cancer patient "YS" are 
presented as SEQ ID NOs:206-352 (clones beginning with "YS"). Polypeptides encoded by 
open reading frames of the YS nucleic acid clones are presented as SEQ ID Nos:830-1083; 
25 the correspondence between nucleic acid molecules and encoded polypeptides is shown in 
Table 2. 

Prostate cancer clones 

The nucleotide sequences of clones derived from prostate cancer patient "ZH" are 
30 presented as SEQ ID NOs:353-593(clones beginning with "ZH"). Polypeptides encoded by 
open reading frames of the ZH nucleic acid clones are presented as SEQ ID Nos: 1084-1 332; 
the correspondence between nucleic acid molecules and encoded polypeptides is shown in 
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Table 2. 
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Table 1: Sequence ho mologies (GenBank Accession Numbers) 



SEQ ID NO. 1 
NGO-Br-38 combined 



NM_006644.1,AF039«S95 1 |,AB003334.I,AB003333.I > D86956.1,D67017.I,D670I6;I 247807,1 NM 013550 1 

10 AE00361 1.1, AL109620.4, ACO07O49.8. AC005992.15. AC007066.4, AC0060801, ACTO9155 3 AF^716 1 

fSZSSl 1 * A 0002367 1 . AL16IS53.2, AL161539.2, AL1 17202.1, AL009l83.l0,'z97336.l AB006696 1 
AI658961.1, AW571648.I, AW474070.1. AA843693.1, AW608075.I, AW470142.1, AW572452 1 AA5430541 

A ' A,83,339 1 > AI753470.I. AI312753.I, AI803588.I, AI563996.1, AA232636.1, AW0 S796 1 AWU7W i I 

Aifiioon !' AA£35277.1, AA993280.1, AA632202.1, AA912023.1, AW627645.1, AW02705(U, AI337175 j ' 

l^i^iV'u^, 6 ? 67,1, AA'^SOe.!, AA485151.1, AI369932.I, AI250881.1, AA933881.1, AI262020.I, AI751852 1 
AI050716.I, H52653.1, AI651 186.1, AA678506.1, AA582157.1, AW628153.I, A 493255 I AW340810 1 I AI223825 1 
AW837156.1, AA136424.1, AA953645.1, AI582484.I, AI673134.1. AW820299.1, AA394027 7^8153 II nSSl 

AW604836.1,AA730742.1,AA082043.I,Z20100.1,D58216.1,AI799265.1,D29622.I AA435594 1 AA233888 1 
AA485036.I, AI6I2928.I, AI630481.1, F07487.1, AA7317I6.I AA417255. , AA80437 L 1 AA571 3 591AA465" 183 1 

AC01 1743.3, AP000635.1, AP000610.2, AC008070.3, AC022797.3. AC005506.6 AL096782 3 



45 



55 



60 



SEQ ID NO. 2 
NOO-Br-39 
MK262/T3 5' 



t^t 9 , 51 ' AB003334 «.D86956.I,NM_006644 .1, AB0O3333.I, NM 013559.1, D67016.1, U0406 I Z47807 1 
D670I7. , AB005277.1, AB005278.1, NM_01 1020.1, U2392I.1, D49482.1. ABOO 926 1, NM 0M278 1 AB023421 I 
L12723.1, AB005279.1, X67643.I, AB005280.1, AF077354.1, NM 0083001, AB023420.1 D85904 1 XBW528I I 
If SSff ISSZil AC01 ,29 °' AC009424 -2. AC022520.2. NM.013393. 1 , AaSlMS^ffrSiu 1 1 1 
AF13671 1.1. AE001434.I, AE001433.1, 249769.1, AC0248 13.1. AE003645.1. AC01 1609 9 AC004150 8 ACO048O1 
AL163244.2. AP00I699.1. AP001605.I, LI6771.1, AW820299.1, AW859988 l^SwXS' I lA^iSJ ^ C °°* mU ' 
35 AW820234.1, AW206874.1, AI0940I5.1, AA885873. J, AW820232.I, Mi0297QA^^6k1 I 
AA580595.I. H91 160.1. AA777031.1, AW608075.1.H54657.I, H64019.1. A.658961.1 H6355I 1 XASnSa 1 

^fi^!-^ 4 ^ 

f?£££ At AA8050,6 - , .' J 087W.I > F07487.1. AW63I423.1.™3090.I,N84915.I, AW630933 1 AW4740TO I 
AA166806.1, N84914.1, AI758907.1, AW103624.1, AW571648.1, AA394027.1, AI00288 1 AA094644 1 I AW391561 1 
AW362751.1. H63595.1. AW609781.1, H54656.1, AW572452.I, V 8 6085.1, AW57756Tf 

AJ39736l.l,AA334479.1,AW754210.1,AW583074.I,AI760838.1,AW578928.l,AA2I2025 1 C8I194 1 AA645750 1 
AW819755.1, AWI25594.1, AU080443.1. AA9 19208.1, AA755774.1, AA6 15363.1 , AA445 826 1 AA 1 17945 1 

AMSMi aa 8 6 ^ 7 , V SKS?. 1 ' AA6265241 ' AA079853...'W22433.», T*XSi fSSoS I AW8 39,03.1. 
AL137142.8, AC01550I.3, AC021286.3, AC0O6882.2, AC068895.1, AC0551 15.2, AC0136604 AL354918 3 
A ^!^^' A ^i°^ 6 - 4, ACOl02675 > AC008642.3, AC008484J. AC006279.6, AC006278.'6,' AC016522U! 
AC019327.4, AC021435.2. ACOI 1301.4. AF216669.1, AL159973.2, AL034557.7, ^ 

50 SEQ ID NO. 3 
NGO-Br-39 
MK494/T3 5' 

^n?^ 5 -™. 0 ?^ 3 . 41 ' D86956 1 ' Z47807 1 . NM_006644.1. AB003333.1, NM 013559.f ; D67016.I, U0406.1, 
D670I7.I.AB005277.1,AB005278.I,AB005276.1,NM 011020.1, U2392 1.1, D49482. 1 ABOO 1926 I NM 014278 1 

AC009424.2, NM_0I3393.1, AF0934I5.1, AC010852J, AF16131 1.1, AF13671 1.1 AC005516 1 AE001434 1 

ISSlSii JS!' Vrmnis/f 0 °J-n 2 l ' ' ' AC007678 - 3 ' AC*06403.3. AO»S5S m£lt, ^004668,, 
AC004879.1, AC006354.2, AC010I83.6, AC005049.2, AC0O4150.8, AC004801 .1, AF049895 I AF068862 I 
AF004739.1, AL16291 1.1, Z68341.1, AL032629.1, AL023578.1, U41009.I, Ll6mA^i 5 TfiSosn4 1 
AA777564.1. AA885873.I. AI702970.1, AI800379.I. AA580595.1. AA8050.6.1. AW63.423 I, A^S J 
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AW630933.I. H91 160.1, AI290252.1, H54657.1. H640I9.I. A1002886.1, N849I5.1 "^J 5 : 1 ' 
Aum«563 1 H54656 1 AW577563.1. N84914.1. AA094644.1, AA749004.1, H912U.I, AI758907.1, AA777031.1, 
^^'!'Ml^6\AW^lMSmi2.l, AW859988.1, AW859943.I, AW820232.1, AW820234.1. 

<: AiinM^I AW82023I 1 AW362766.1, AA555929.1, AA555921.1, AI65896I.1, AW820224.1, AW39157Z.I, 
^Ml9l'S^\^^ll/^^^.^ mM -^ AW754210.1, AW583074.!, AI760838.1 

AAotoonfii AA7SS774 1 AA615363 1 AA445826.1, AA117945.1, AI633338.1, AI203278.1, AW8195W7.1, 
A^62^ 

A^SiiS 

AC026995.2! AC018688.4, AC022758.3, AC013294.3, AC006876.1, AL1 17373.6, AL1 17335.19, AL157821.1, 



15 SEQ ID NO. 4 
NGO-Br-55 
MK225/T3 5* 



25 %%S£i i IwSS iS? AW293828.1, AW1494I3.I, AW064723.I, AW016496.1, AW008028.1 
ISaf 1 S 1AbS«?1. AI540768.I, AI538719.I, AI360009.I, AI126655.1, MtUOI. , 

30 AC060234.2, AC015958.3, AP000898.2, AP000919.2, AL121920.1 1, AL353 195.1. 

SEQ ID NO. 5 
NGO-Br-55 

SS52* J NM 005716 I, AF089816.1, AE001 104.1, AL096829.17, AJ007636.1, L38482.1, AC012467.9 AC007252.2, 

iSmuSS^U^aA t Mtm95.U AI755163... A.472081.1, AAWUJU. A 07390S ., W73036 • 
amo^ i AIB87371 1 A1032395 1 AA581812.1, AA149940.I, AA535595.1, AI085734.1, AI951003.1, AA666165.1, 
^SK" ' ^79893 I AW24402 1 R32I 0.1, Xl24 1 188.1. N64621.I. AA740666.1, AI589363.1, AW079516. . 
A A67^56 1^669841 AB«4^ 

l^SSS" ' SS^'l I AjSSal IAW193998.1, R4018I.I, AI886660.1, AA612759.1, AI867293. 1, AI4991 13.1, 

' iSSJS t SX^ll. R32109.1, AI804 8 »6.1, T30333.L R09164 1, RJ719U AA 404 222.1. 
A^M135*l! AW664565.1, AW664371.I, R33694.1, AA1 6021 1.1, AW439960. ^* ^^^^^ ^ Aa*Ai^l<j^ T16203 1 
^ii«i^n I AiioTBii i AWi«n2l8J A1370449 1 W73301.1, AI298917.1, AA1602I2.1. AA434159.1.T16203.1, 

iiSSS l^liVA^^i:^ A^«9.i. T48755,, ^^VEJSiVS ^ 
N55776 1 AW0074 13.1, AC008569.5, AC010765.2, AL157781.1, AC0078I9.7, AL355350.2 AL161646.5. AL162454.2, 
S5162i7ACT26^ 
50 AC024715J! AC023914.1, AC0I0729.3, AC010147.4, AL139253.1, AL031301.I, 

SEQ ID NO. 6 
NGO-Br-61 

« aSS f AK001625 1 AB020657.1, AK000931.1. AL137640.1, NM_016389.1, AF161553.1, AKO01273.1, 
60 So41 1 M*9mZ AA962704A, AA581961.1, Z28830.1, AI621215.1, AI560075.1 AA603342.1 AA21 1203.1 
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15 



20 



25 



a ' * ™' ' A,596266 1 « AI929865.I, AI7S0736.I, A1649320.1, AI043J96.1, AL023060.I AW630831 1 
AI3U622.1,AI314243.],AI098095.1,AI(M3182.1.AA5I121I.1,AA434721.I,AAI40498 1 AA098508 1 SSSl 
AW532477.1, AI408553.I. AW750607.1, AV2.8438.1. AI048358.1, AA458054.1, AI76349 ' 1 AV31 1575 , ' 

AWI45984.1, AV159067.1, AI607800.1, AW535768.1, AW822436.1, All 82297.1, AA313 32 1 AATWSSSM 
AI97I805.1, AV209231.1, AV207950.1, AV154324.I, AVI 18302.1. AV\7507lAAW\6SS2SAWm£l' 
AP00I803.1,AP0OC)479.2,AC027649.4,ACO12429.4,AL353692.3:^^^ 

AC067813.1, AO02I601.3. AC023659.2, AC023818.2, AC009009.2, 297201.7 AP001815 1 AC °° 8670 - 3 ' 



10 SEQ ID NO. 7 
NGO-Br-«l 
MK751/T73' 



SS^SiJ* ' ' AB020657. 1 , AK000931.1, AL137640.1, NM 016389.1, AF161553.I AK001273 I 

AE003772. , AC0O4843.1, AF00314I.1, U88180.1, AL034350.2, AP000606.I, AC006068.3, AC00603 I 2ACT06996 2 
^ 417AAW53,9J ' AC0 ^^ AP000185 I AP000283 1 

A 625041.1, A1498683.1, AA962704.1. AA581961.1, 228830.1, AI62I2I5.1 A1560075 1 AA603342 1 AA2U 203 1 
A 453000.1, AA505767.1. H29506.1, AI493.65.1, AW338106.1, AW27.945 1, A1561 182 ^357213 ^8m65 I 
AI950251 .1, AA18264I.I, AI750267.I. AW536810.1, AI893732 1, AA881079.1, AA833428 I AA759435 7 aaSto , 
AA260237.1,AI564.93.1,AA172740.I,AA837350.1 AA572435. ,AA2905^ 

AW681468.1, AW261744.1, AA638984.1, AW107357.1, AW261646.1. AA 1 70526^848235^77826 I 
AM57598.I, AI750915.I, AI596266.I, AI929865.1, AI79\>736.1, AI649320. 1 , AI043 1 96 1 i^230M 1 AW63083 1 I 
AI314622.1, AI3I4243.1, AI098W5.1, A1M31 82.1, AA51 121 1 . , AA434721 1 AA14^^^^ 
AW532477.1,AI408553.1,AW750607.1,AV218438.I.A^^ 

A ^SV AA09I45U . D581 « «. AI91 1938.1, AI548180.1, AA086929.1, A 581089.1, AW8^437 I AW2084I4 I 
AW145984.1, AV159067.1, AI607800.I, AW535768.I, AW822436.I, AII82297.1, AA313 32 1 AA799539 1 ' 

AP001803.I, AP000479.2, AC027649.4. AC012429.4, AL353692.3, AG069214.I AO)24096 !7 aSsTOI 
30 AC067813.1, AC02I60F.3. AC023659.2. AC023818.2 AC009O09.i 2972017:^8^5 1 ' AC ° 08670 - 3, 



SEQ ID NO. 8 * 
NGO-Br-57 combined; 



AF025438.1,AL050353:1,AL121924.I2,U42838.1,AL031055.I,AE0036'80.1 AC005539 1 AL12193I 10 Al non7« •> 
itSi^!" ^f 80 - 2 ' AC010889.2, NM.007050.2, AF0436vU.4, ABimS!^^^^ AL139076.2, 
AE003533.1,AE003519.1,AE003480.I,AE003422.1,AE003217.I,AE002799.1 AC004455 I ACWW3207 
A^7478.1.AC007123.1,AC005966.1,ACC>05548.1 A^ Z68335 , 

m^ 73J ' Z92M41> ALI ,05 ° 3 - 1 ' Y«8930-1. APO01687.1, APO01297.1, APW^S.ABOOSa^ " 
? ™' D,7797 • , • X79O80J ' AMWOS^-i. AB006621.I. AA701988.1, AI337332. 1 1. Xl7657« Tl AI96400? I 
AI828070.I.AI304319.1.AI760923.1,AA236789.1,AWI61742.1,AI765022.1.A19353401 aSmcS 1 ^A865602 1 
A i 7 ^«'' N66532 ,> A, «1687.1, AA916723.I, AW161135.1, W58718.1, A^St i^Tl ^05^24 f ' 
AA024685.1, AWI5225I.1, AW772254.1, AA916358.1, AA313566.1, AI336I21.I AA024784 1 I AW614505 I 
AI888263.1, N23163.1, AA007455.1, AW272790.1, AI 167263. 1 , AI283 1 04. 1 Ju!i^lAA9^l^irta 1 
AA5056I8.1, AI073755.I, AA913049.1, AI538205.I, AA670386 1, AA0073 19^352390 t15£££i f Sis 1 
4*«foSi '' AI090I62.1, AW466965.1, AA723980.1, AI808237.1, R72404.I, AI08I040.I A>^^225^ 1 AI^TOrS^ 
AA541923.1, AA532854.I, R41738.1, AA236656.1. AA928158.1. AW1 17185*1. AI63M38 1 AA016221 IAA345744 1 
^SV 1 R72405 '. AI140745.1, A1084344.1, AI079153.I, AA852227.1, AA852226 T/hsX^W^ f 

^! 21 ' AA38553U - AW427494.1, AW557853. 1. N50079.1, AI46I7I3 1. AA858049 I AW536613 1 Sl40 1 
r i ^L , -, A Mc 7 i, 339 -'' AA « 7410 '. H3050I.1, AW172462.I. RI7187.1, AI63W24.I S3T1 R7780^W43? 7 4 1 
AII98I48.1,N56244.I,AW433804.1 > AI84I918.I,H25699.1,AA003291.1,AL1361317 AL355349 1 AL138706 1 
AL050335.24. AC016073^ AC023651.2. AL354992.1, AC026285.4, AC0551 16 2. ACOI2.3I 3 ACM6756 1 ' 
AC0I2031.7, AC007953.7. AO027502J, AC026747J, AC008821.4. AC016635.4 ACTO89265 A^Si 

A ™wJ' AOT,639 ' 6,Aai6m4 ' AW13m 

SEQIDNCH* 

D26077,AJ009839.U00996,AF035621,AJ002223,-AFOI3116,X57435.AFI3440I.l AC004653 AL024473 AC00474T 
AO004453, AL023806, U36562, U64849, AF016450, AC003689, Z77652, AJ223630 ARB602 kS^AOmSS 
270687. AL034351, Z94054, AC005955, U91325, AF051917, D90054, AF039«M7 AcS 

W75604, W88219, AI390662. AA107502, AA959827, AA5625I9. AA i 39695^ AIW5854 C80964^^OB9'^f ' 

AA36800I,AA827488,AA425663,N84321,AA040741,AA084287,AA339843.AI524007 1 N73729 N75454 
AA025609, AI24435I. AA489142, AI283076, W05252, T98110, AI244357. AA659485 AI2*S 
AI659137.1, D36418, AI065185. C67420. AA. .6.98, AU000875. N981 52. CSeS^M^SSS^Ji^^l. 
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AA096046 AA095359, AA096066, N84718, AA09564I, AA247964, N83168, N86694, N83992, N87989, AA095435, 
AA093577 AA096061, AA249353, N84781, AA249712, N88018, AA092086, N55721, N88518, AA093861, AA089553, 
N84765, N56555, N84829, AA2I5908, AA093897, N88496, N84828, N89307. N84740, AA093219, N561 18, AA0933 13, 
N84016 AA090302, N86439, AA089554, N84723. AA0955 11, AA2I5911, N84562, AA247828, N85031, AA094237, 
5 N84602* N84733, N84875, N84921, N86441 , N56179, N84575, AA095475, N84721, AA096013, N55684, N55681, 
NS5768 N84S6I, AA09592I, AA095473, N84662, AA247800, N84764, N55669, N55700, N55641, N55659, N55697, 
N55639 N84859, N84874, N84722, AA249323, N85900, AA249064, N55653, N84873, AA248551. N84797, N85930, 
AA095919, AF041408, AJ241 143.1, A1483326.I, AI483209.1, A1354060, AI353169. AI617228.I, AI353694, A1483218.1, 
AI618568.1, AI618635.1, AI353159, AA660164, AI617432.1, AI617214.1, AA933363, AI616967.1, AA585825, 

10 AI6I6416.1, AI616808.I, AA933116, AU004045, AU004063, AU003308, C93848, AU061971.1, AU061862.1, W43681, 
AU012213, AU061926.1, AU062I20.1, H07848, AI353413, AU061949.1, AU062001.1, AU062063.1, AU061924.1, 
AU06I975.1, AU00I522, AU001536, AA618732, AI066886, AT00069i-,C93682, A04374.I, A27635.1, 117659, 190252, 
E08428, A17373.1, E12149, A17374.1, AR009152, 148933, Al 7063.1. E02958, A05144.I, AR007512. E08429, A14395.1, 
132196 124105, 108013, A29289.1, A25918.1, A259I7.1, A08586.1, 159710, A45792.1, E08430, E04616, 128830, 115717, 

15 A14104.1, 118794, A26449.1, A05143.I, A17()64.1,I24IH A33348.1,A22739.1,A22738.1,A23903.1J15713 

A37288. 1, E07853, A37287.1, A34041 .1, A22736.1, A20700.1, 124701. 177293. 125179, A33349.1. A20702.1. A7 1440.1. 
AR01 8093, AR018092, E12434, A50146.1, E02074, A22740.1, A21230.1, 143706, 148927, A42089.1. A18050.1, 170384, 
A26447.1. A29288.1, A21625.1, AI3387.1, A13038.1, 106961, 105487. 107816, 192483, 183451, 183450, 185513, 
A46760.I.E02073,A17370.I,A11323.1,AOI519.1.I15148,E12615, 138604. 124920. A1337I.1. A10871.I. A02710.1. 

20 105558, 109132, 190245, 146906, 144531, 124703, AI3388.1. A29286. 1. A23997.1, A21386.1, 166485 

So070, ACOW405, AC006979.2. U80443.2, AJ005821, AV01 1931.1. AA218244, AI385712, AI462105, AI428532, 
AA097078, AI120426. AA275245, AA871884. A1509894, AAI68615, AA522413, AA222085, W36524 . AA709795 

25 W08491, AA500709, AI648883.1. AA039140. AA986882. AA163074, AA45I366. AI325541, A1615295.1, A1481672, 
AI037075, AI662626.1, AA684194. AI649O00.1, AI591703.1, AU016270, AA667101, AA734486, AV013533.1, 
AA125031. AA476011, AA789350, AA497964, AI587926.1, AL048 130.1, AL047646.1. A 1564569. 1. C06476. 
AI564600.1, AI583605. 1. AA776250. AA5641 12. AA486728, AA458903, AA284505, AA744683, AA744677. A1041865. 
N35013. AI367320. AA478033, AA723251. AI161355, AA521095. AA653613, AA173528, AA160880. AA653144, 

30 W72421, AA031689, A1095313, AA744691, AI243169.I, N27658, AA909152, AI381956, AA548423, AI240491. 

AA150688, AA705238, W76280, H24935, AI290052, AA099284, AIO03O89. AI041 158. AA299485, H47593, R8748I, 
H06272, AA670014.1. H62215.T92938, F32136.1, WI5223, H28559, AA045285, H57205, AA490932. R789I9 
AAI65451, A1206471. AA370855. AA853565.1.T92716, H51597, AA831 147, H38452, T23463, AAJ76247 . TO331, 
H97605 T927I2, AA9O4909. R62767, AA568274, AA385864, AA814518, R71478. AA887917, AA833577. AA936405. 

35 T92721. R89767, R26297, AI240490. AA975607. AA975610, AA132058, T94009, R33615, AAWH^I^. 

AA523772. T27752, A1675329.1. T55025, AA340520, AA640536. AA09053I , T54861, AA459097, AA564540. H99789, 
Al 136460. AA900065. AI0IO678, AI230737, AI599236.1, AI105068. AI412I35. AA955854, AIOU 100. AA955166. 
AA597982, AI104512. E0S646, A39800.1. A39798.1, A58656.1, 109386, A37005.1, 134189 

40 X98494,^M848, D87023, AC002060.2, AF036707, AF022981 , AF125969, U76408, D8701O, AF046084, AE000127, 
AF120927.1. U08110, AF046092, Y18000.I, Y14591, U53154, U85198, D87009, AF036359, AF0287I0, U65020 
Y14592, U48809, AF057293. U90439. AE00066I. U20857, U18428. L05251, U10577, AF047659. U97079. Z93375. 
Z49908, M63783, AC005965, AC007061.2, S76016.Z68328, AL021 107, X96469, Z73905, AF030052, AC004051 

45 X16561. U67889. AL031652. S74622, AF025452, Z95 1 1 3, M37I29, AC005927, U40375 AB026647J AB027513.1 

X06862, M60858, M17571. APO00076.1, X06856, AB01 1 164, U58652, L05904, W75630, W64795, W33952, AA388279. 
AV020965.1, AV024242.1, AA822594, AA61 1358, A1666755.1, AI664403.1, AA217227, AA274596, AA667372, 
AI661I15.1, AI551976.1, W62996, AA 103552, AA120182. AA675019. AA895561, AA183615, M020 ™>J?™™}k 
AA240412. W8I743, AA178559, AII94355. AU59163. AA968004. C85953. AA259950, AI183094, AA0I3544, W18876, 

50 AA407843, AU023725, AI508428. C87289, AA061215. AA259498. AA030860, AA1 17303, AA268047, D18368, 

AA41 4028, AA285757, AA821550, AI527508, AA967944, C86912. AA572380, W74843. AA848124, 124602^603307, 
AA159246, W73654, W01754, T92366, T93760, AI096565, AA159254. T90267, T90227, W26394, W28236 TO2399, 
AA054682, AA329899, T92391, H99855, AA131 1 15, W73607, AA147878, Z78377, A1080454, AAO" 8 ^^ 6028 ' 
T94225, T93759, T94254, T90230, AA166879. W27620. T93753. AA55 1443, T6391 1. T93558, T94665. T90616 

55 AI2731 14 TO4922, W28045, AI070777, AI13765I, C94041. AI546038, L37652, AI4069O6, AI641607. 1, R90246, T7571 1. 
AA908051, AU062784.1, C46974, AI295651, AU060842.1, AI054913, AI296000, AU033967.1, D687I5, C49179, 
AI443037. AI384793, C99888.I, AA9581 14, AI294740, C96722. AA950741. AA497303. AA495228. D36000. AA202384, 
AI044390, A1257069. A1514933. N83025, AA842900, D74159, AA952I44, AI216940. U78748. AI25345 ^ 39 34 5 
A1539928, C09348, C43842, AA651326, AI437064, A1476857, AI548859, AA941414, C96769. D35760 C09488. OI5265, 

60 AI102403, C49820. AU039361.1, AI043616, C44692. AU037725.1. AI540027.C83963.1. A50142.1, 155033, 187853 

A00764.1, 123464, AR013966, E12103, 128325, 171491, 119108, AR003567. 171490, 128591, 119102, A59205 1, E06594, 
123439, 191514. 138225, 180921, E05541, E05543. E08652, 11 1571, 109218. 168135, A46292.I, II 1583 J« 34 1092 1 9, 
112143, A22942.I, A45346.1, A30331.1, AR022373, AR022395, A30330.1, A32827.1. A58691.1. A30354.I. E05544. 
112142, E02506, A46291.1, 112873, 134431 
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SEQ1DNO:70 

X98494, AI386428, AAI62I48, AA213194, AA881872, AI647220.1, AA590060, AI326008, AA6I9205 AI284853 
AI222419, AA992199, AI681988.1, AA992130, AA025657, AI087795, AJ263606, AA0833I4, AI094541 AA847842 
5 AA731098, AA047545, AI420376.1, W80758, AA770202, A1357730. AA909134, AI271912, AA810790 N68965 ' 
T97061. AI056034, AA668325, AA5041 13, AA3471 16, AI2443J5, AA837327, Z25156, AA888598, H8880I F00393 
AI679289.1, AI679865.1, AI000365, H89025, T96950, AA916136, C02251, AI177638, AI406906, AA817668, AA901350, 
A 1483281. 1, 156746 * * 

10 SEQIDNO:7l 

AF035606, U37573, U491 12, AF082186.2, AF053408, X53937, AJ007829, AD001531, D50010 D78345 Y09813 
U51 1 13, L08785, U02430, M7781 1, U43955, U41513, AF053407;* 16359, U43957, U39779, U43956, U02437 U703 1 1 
X52326, L08786. U141 19, L08784, L08787, U02449, U43954, L26977, U02457, AJ23477I.1 L08782 I AF053409 
, < JS^f" U ° 1668, L08782, AF054625 ' M29362, U69698, X52324, M68946. AF128862.I, Z29589, AF038666, M29363 
JSi^^^^^f 078810 '* 983 * 3 - X82 ' 90 ' AB0IS619.I, UU121, U141 16, U84006 AF005420, 

AJ005339, AJ005324, AJ005323, U89927, L08874, Z32836, AF1 1 8920.1. U47102, AF041426 Z47173 U47103 
X65334.1, X65316.1, X65314.2, X65306.I.Z47I59, X65315.I, U25268, U25272, AF092546, X65309.1, X65308 1 
X6531 I.I, X65307.I, X653I0.1, X65313.1, AF092940, X65304.1, U03442, U03440, U03435, U03438, U03436 U24178 

Kl S?iSi > JS21- ^ 297,781 AF045432 « U39066 ' AA855573 « AA959713 ' AI646046:i, AI415428, ^1 W76 * 
20 AI324822, C85464, AV0I2020.I, AU0I9447, AV028650.1. AV03387I.I, W36252, AV044086.2. D77020 AA213236 

A i^'fn 99 ^i^•, A,42979, • AV0 »H'->. AI3I6230, AI573864.I, AI661543.1, AA254914, AI132558. AA168802 
AA710186, AF093453, AI315071, AA88I3I6, AII97054, AI256149, AI527428, AA238081, AI152451 AI132585 
AA238390, AIJ21035, AI099353, W13766, AI157500, AA871944, A1132345, AA825538. AI522238.1,' AI57208o'l 
AA831357, AI360561, AA77526I, AII40796, AA835492, AI361820, AA100279, AI277I90, AI469550.1 AI015234 
25 AA581345,D20022,AA122332,AI355770,AA485257,AA092467,AI4718I7,T34498,AI597962.1 AJ624976 1 ' 
CI4723, D57491, D55233, CI6300. C16305, AI541540, AJ526201, D61254, AI540858, A1557252.1 AI526143 " ' 
^HVJ*' a,'*™' ^! 2 S' 1 * A155727I.1, AI556966.1. AI540876, AI546949, AI546984. A1557859.1, AIS57866.1. 
AI541513, AI547042,AI54I1I0, AI540926.Z28355, AA585101, AI541528, A1526059, AI541522 AI557850 I 
AI525316,AI54I541,AI525306,AI541523,AI541396,AI526150,AI526092,AI526I98,AI55786I.I AI557863 1 
30 AI546875, AI546896, AJ546940, AI541530, A1546837, AI557856.1. AI54I521, AI547171, AI525296 A1S4I390 ' 

^lltUll' T41289 ' AI547165 - AI546999, A1547080, AI547I47, AI525431. AI541537, AI5469I7, D57186, AA174I70 
a »««' A ^. 439 . , , A /. 541374, R29 ' 77 ' A1525204, R29218, A1541535, AI526045, AI55773I.I, D53447, AI557799.'l, 
AI525556.CI6293, AI541307.C15069. AA660699, AA751703, AJ058193, AA441129, AA695353, AA696974 
AA821091, AA8I66I0, AA697069, AJ021792, AA694821, AA696904, AA803031, AA695217, AA802872 AA753798 
AA ^ 2 ^!' AA7Sl523 ' AA696639, AA698I05, AA525623, C94908, AA754494, A1215223, AI374340. AI2I5263 ' 
AA75 1 866, A A75 1952, AI374376, C065 1 1, AA754497, AA75 1 69 1 , AA752525, N98067, AI563441 .1 AA52558 1 
AA052885, AT001210, AI2I5226, AA751833, AA75I907, AI507946, AI563445.1. AA803406, AA735727 AI215264 
^i 52 , 1 . 5 : A A75 1591, AA75 1 592, AI096I52, AI2I52I0, AA752541, AA695930, AA697684, AA802970, A1374225 ' 
» A »™ 3 '^™ 0, AA7SX93A - D43391, AA694838, AA735768, AA697195, AA698716, AA750248, AA080594, 
) AI02I791, AA539955, AA75I578, AA752036, AA750I87, AA75I554, T0O697, AA751859, AA696794 F20138 R46884 
AI180308, AA956495, AA7532I3, AA8I9488, AA8I8950, AA851 164, AA439637. AI014075, C06771, AA751850 
AA752256, AA752020, AA566697, AA961326, AA754338, AA751678, T24259, AA080579, T15066 AA751446 ' 
AA754399, AA751589, AA754496, 118794, A25909.1, 169323, 169324, 163120, A20702.1, A43188. 1 ' A20700 1 ' 
A43I89.1, 184553, 184554, 105558. A64973.1, 170384, E03627, A60210.1, 106859, A18050.1, 148927. A601 11 V 
A6021 I.I. A23633.1. A02712.I, A60212.1, AI8053.I, AR007512, A23334.1, A60209.1, 113349, A1036I.1, 149955 
All 178.1, E01007, EI3740, AI 1624.1, E00609. AI 1623.1, 100682, 138604. A70869.1. 144681. A04664.1 A02196 1 
A 2S^ 1, A35537J » A04W3 '. A02195.I, A58522.1, 162368. A13393.1, A13392.1, A027I0.1, EI2615, 103331, 144516 
A70040.1,I21869,A07700.1,I28266,14453I,I49890,II352I,A27396.I,I52048,I21I70,A58523.1 115717 115718 ' 
A58524.I, A24782. 1 , A24783. 1 , A 1 1245. 1,11 8895, 133154, A70872.1, 108396. 108389, 10805 1 . 160242, 160241 126927 

IS; IS 108395 E2' iSr 515, A22738 1, 166494, ,66495, 166487, E00697, E00696, A20699 • ,, I0912 ^ 

SEQ ID NO:72 

AF035606. U491 12, AC004485. U73627, AC000389, AE001 146, D89223, AC004923, AC0O0385 U03396 Z95397 

A m™ 3, 1 AA ,V^ 76, A" 15428 - AI646046.1. C85464. AV012020.1, AV028650.1, AA855573, AU0I9447, 
??l^ A ^l^ mA5 "' A,3,623 °' A'573864.1, AJ197054, AI661543.1, AA88I316, AA825538, AI522238.I, 
AA831357, AI572080.I , A1360561, AA77526I, AII40796, AI361S20, AA835492, AA1002797"AI277190 AI469550 1 
AI015234, AA581345, D20022, AA122332, AI355770, AI471817, AA485257, AI597962.1, AA092467 T34498 
tltltllV,' i^^ 3 , 23 ^ 39, N663M> A1583131.1, AA235383. AA749042, AA424515. AA918245, AI251010, 

^loH 'o N21277, N75967, AI538241, AA747919, AA836065, AA555024, AA829834, AI2921 14, N32584, AI29I299 
T06835, AA851 164, AA819488, AI014075, AA8I8950, AA956495, AII80308, AU060744.1, AMI 1437 AI137403 
AI408578, AI230355, AJ41 1858, 157316, A10265.1, A07704.1, 182512, A264I5.I 
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AF089816 AF032120, AF089817, AF089818, AF061263, AF104358, AA396587, AA839164, AI645842.1, AA259652, 
AI462731 AA396061, A1508747, T25830, W06974, AA300306, AA158704, AA702414, AI335709, AA974969, 
AI193578 W87364, N39553, AI669881 .1, AI424712.1, AA513461, R10174, AA565967, H25130, AA468577, AA367767, 
AI418022!l, H49150, AI369600, T52003, AU056473.1, ARO 12064, 187064, 184560, 165545 

5 

SEQIDNO:74 

AF028824 AF089816, L38482, AE001 104, AL033502, AC005757, U34830, AC004076, Zl 1490, AJ222796, AL022069, 
Z75543, AC006056.2, AC006508, AA285636, AA71 1082, W64914, W20880, AAI72932, W09810, AA793773, W62886, 
AV026607.1, WI8328, AA73458I, W53794, AA674963, AA175523, AA222652, AA286301, AA067058, AA727901, 

10 AA692352, AA608460, AI429553, AA216860, AAI63431, W97171, AI286788, AA450721, AA273318, AA267167, 
AA203863, W98132, AA733321, W61471, W10683, AV042841.2, AI662163.1, AA691246, C76145, AA597140, 
AA178040, AV041261.2, AA929I00; AI586270.1, AI324366, AA5I2291, AA638455, AA217257, AA138312- ' 
AV040396.2, AA869209, AA822151, AA7I0587, AA689043, AA178370, AA152940, AA122810, AA571067, AA529128, 
AA276820, W53574, AA175721, AA146102, AA137743, AI528673.1, AI504720, AI464870, AA616238, AA183539, 

15 AA172429, AA170209, AA145390, AI325168, AA616060, AA267185, AA139433, AA125142, AA688934, AA689641, 
AA267963, AA22I937, AA220370, AA170764, AI530662, AA733250, AI660895.1, A1472081.1, AA781474, A1073909, 
W73036, AI032395, AA581812, AA149940, AA535595, A1085734, AA666165, AA579893, AI624402.1, R321 10, 
AI241 188,N64621, AA740666, AI589363.1, AA677956, AI343472.1, AA878576, AI634734.1, A1423229.1, R50716, 
AI683679.1, AA705739, H64249, AI272198, AI654473.1, AA325291, AI672928.1, R40181, AA612759, AI4991 13.1, 

20 AA404606, A1270050, A1056166, AA995431, AI289585, T54484, AI218312, AA918644, R33590, R32109, T30333, 
R09164, R77191, AA404222, AA304135, R33694, AA16021 1, AA320369, AA135772, AA135729, AI392813.1, 
AI370449, W73301, AI298917, AAI60212, T16203, AA434159, N78888, AA295659, T48755, AA887316, N55776, 
AI245392, AI366949, T25831, AI278660, AI364244, H64248, AA150525, N79950, R09267, R8021 1, AA157962, T73936, 
AI597799.1, R98601, AA374943, H50728, AI298607, AA423820, AA622465, AI569836.1, R55012, H44307, AA282809, 

25 AA419079, AA206818, AA37180I, AI090123, AA506994, AI298776, AA436972, T82181, F22959, A1228589, AI013903, 
AI562315.1, AA697640, A70195.1, 125849, 178457, 162859, 108631, E01324, 108638, 115551, 107396, A44968.1, E08433, 
142577, A68700.1, E01495, 168738, A08267.1, A45357.1, A46785.I, A23164.1, A42378.1, A45334.1, A08269.1, 
A65720.1, AR022391, 180845, A45372.1, AR022409, E06904, A467 18.1, 114085, AR02241 1, 112883, AR022381, 
A08862.1, 113029, 150851, A46720.1, AR0224IO, A45340.1, A70680.1, 180847 

30 

SEQID NO:75 

X03205, U13369, M10098, K03432, X82564, XO0686, Ml 1 188, X01 1 17, V01270, X06778, K01593, X00640, D84514, 

X04025, X59734, M97576, X59733.1, M91 180, API 15860, X02995, J00999, KOI373, X98843, M91 182, M91 181, 

M91 179, M91 183, X98841, X98846, LI 1288, X98844, AF102857, U87963, X98840. X98837, D50494, X98842, X98838, 

35 X98836, X98839, X98845, AF030250, M33066, M59402, L24123, M97575, M59384, M97573, M59393, M59401, 

M59392, M59386, AF021880, M59385, M59391, M59387, M59399, M59397, AJ007613.1, L81946, X70210, AJ007614, 
U50968, AF062955.1, U08333, X8163I, X80233, U08327, LI 1230, L11266, L11270, U93555, X79877, AF025946, 
U36271, AF062954.1, U19519, AF062950.1, X91974, U12647, LI 1269, M59388, M59396, AF062964.1, AF062947.1, 
X87985, U88337, LI 1267, AF1 03730, AF099943, LI 1268, Z83753, Z80955, AF021879, AF099942, U08325, U08331, 

40 U08329 M59390, U67324, AA409121, AU023662, AA407434, AU020382, AA409846, AI256506, AA914790, C86941, 
AI132252, AU035733, AU021041, C87199, AI322276, W20927, D19503, A1324724, AA1 14639, C88357, AA895334, 
AI546975, AI547168, AA090106, AI547I31, A1524874, F299I3.1, F27302.1, F24428.1, AI547125, AI547156, AI547139, 
A1547184, AA215893, AAO92005, AA247334, AI557155.1, F27796.1, H43062, AI547I70, AA585468, AI540955, 
AI547195, AI5471 12, AA2484I7, U46270, AI547189, AA094658, AA336280, AA095372, N25575, AA502901, 

45 AA360125, AA360124, F34726.1, AA669617, AI032872, T29140, AA369101, AA585173, AI554412.1, AI635062.1, 
H46573, AI289938, AI283315, H42931, AA506222, N86367, AI354556, AA532838, N88979, H50362, H45047, 
AA094430, AA346313, U46222, H41656, H26440, AA095425, N44063, F29193.1, AA728882, AA715534, H18223, 
AA725242, H23882, AA91 1513, AA714262, AA809098, AA074261, H26690, AA360197, T94543, AA092500, 
AA484257, H96223, AA3783 12, A A3 1 8469, AI452940.1, AI547166, AA093380, T93007, AA481974, AA300605, 

50 H18966, AA361972, AA515044, AA569231, R82750, T82440, H52350, AA455819, R71893, AA728869, AA744120, 
AA360304, AA713764, H52327, AA378126, AA828929, H61354, AJ241 168.1, AA900286, AI058227, AA850573, 
Z71889 AA991 1 17, AI230423, A14088O9, AI407879, AA850888, AA850889, AA944702, A1058228, AI008416, 
AI083253, AA228229, AA052022, AA2731 17, AA057965, AA598354, AA545824, AA598355, AA280468, AA022350, 
AA570908, AA430839, AA224630, AA842564, AA257199, AA232024, AA253533, AA991043, AA514179, AA570837, 

55 AA056804, AA253514, AA406738, AA471543, AI087484, AA257416, AA056821, AA280496, AA990956, AA585660, 
AA224628, AA080798, AA842506, AA273099, AA246088, AA023868, AA47I501, AII05694, AA022420, AA514160, 
AA675823, AA231991, AA051980, AA661367, AA842135, AA598367, AA052055, AA433431, AA471489, AI083303, 
AI058023, AA417401, AA570918, AU000884, AU001347, AU000812, AU001068, AU00I61 1, AU001774, AU000888, 
AA570916, AU00I817, AU001353, AU001781, AU000838, AUO00775, AA052026, AU000916, AF091041, AU001977, 

60 AU000763, AU004020, AI082942, AA570824, AA228228, AI087453, AJ629970.1, AI621757.1, A1621756.1, AA480713, 
H52877 AA651576, AA991088, AA471400, C95447.1, AA228160, AR0221 16, A07562.1, A70359.1, EO 1 321, 169485, 
109303, 127617, 169461, A21385.1, 158610, AR015960, E01508, AR000007, 158595, 165402, AR000006, 1081 15, 116573, 
AR0I5961, 190051, A49389.1, E06998, 116572, 158596, 158609, 185654, 185656, 172268, A33044.1, A67356.1, A57359.1, 
A67368.I, A67338.1, A67269.1, A67359.1, A27396.1, A67336.1, 126929, A67265.1, E07334, 179228, E08821, A48778.1, 
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A4878 1.1,11 4456, 1 1 4452, 1 14455, 1 14450, 1 1 4453, 114451, 114454, 133579, A5 1 866.1, A67262.1, A67300. 1 A67291 1 
A67267.1 , 192570, 118898, A22424.I, A39827.1, 126928, A02365.1, 140369, A48775. 1, 167829, A48776 1 E0I246 ' ' 
A63774.1, A63776.1, 168031, E12844, 114936, 125006, 136197, A63954.I, E015I0, A58741.1, A48774.1, AR019620 
116901, 140370, E01315, E013 16, A58742.1, 160004, E07337, E03076, 162976, 189769, A48779.1, A48782 I 160003* 
5 A58738.1.A58740.1, AR022119, AR022118, AR022117.A60858.1, E03345 

SEQ1DN0:76 
AI477953 

10 SEQ ID NO:77 

MK3710/T3 5\ AF025438, AL024458, AC005539, X79080, Z48544, AC004455, AC005966, U63928, AC004680 
AAI37279. AA541923, AA000683, W10638, W43974, AAOOMOly AAO03291, A1461713, AA637410, AI58556o'l 
AI430072, AA759800, AA546383, AA607321, AAI 10039, AU024430, AA959647, AU024429, AU022981 AA600493 
AI430557, W15850, W58718, N32746, AA024784, AA3 13566, AA236836, AA007319, R72404, AA236656, A1090162 

15 AI630438.I, AA70I988. AA852227.1, AI337332, AI630424.1, H30501, R17187, A13043I9, AA236789, H25699 N56244 
AA865602, A1631687.1, N66532, A1076924, AA452088, AA9 16723, AA024685, AA007455, AA9I6358 AI336l'21 
AA521369, AII67263, AI283104, AA345744, AII40745, AA451907, AA995467, N23163, R77800, AA505618 
AA913049, AA385531, A1538205.1, AI073755.I, AI352390, R72405, R41738, AA670386, AA334614, AA22839I 
AI625253.1, AI637995.1, N70197, AI648548.1, AAI 48868, F08701, AA216042, AA358819, H35482, AA687041 

20 AI599140.1, AI171338, AA979853, 123866, 127840, 127838, 108198, 140308, E05224, 185624, 127866, 127845, 127844 

127842, 127841, 195863, 127839, 138154, 127837, 127830, 127829, 168296, 168289, 132039, 107691, 103244, 101972, 105124 



SEQ ID NO:78 

25 AF025438, U42838, AL03 1055, AL024458, AB009052, D17798, AB005234, AC000389, Z92844, AL032654 Z68335 
D17799, AC004680, D17797, AB006621, AC007478.1, AC004455, AF043644, AD37332, AA236789, AI3043I9 
AA701988, AA865602.N66532, A1631687.1, AA9I6723, AA024685, AA916358, A1336121, N23I63, AA007455 
AII67263, AA45 1907, AI283 104, AA995467, AA505618, AA913049, AI073755.I, AI538205.1, AA670386, AI352390 
AA680352, AA720562, AA723980, AI081O40, AA992256, AI267913, AA532854, R4I738, AA928158, AA016221 ' 
30 AA345744, R72405. AI 140745, AI079153, AA852226.1, H89982, AA385531, AI539552, AA236836, N50079, AI090162 
AA858049,AI678339.1,A1678340.1,R77800,AU98148,H30501, AA024784, T26930, AI630424. 1 AI630438 1 
N32746, W58718, AA3 13566. AA765777, R72404, H25699, AA827898, AA828343, R16I94, AA620328, AA73I868 
AA807325, AI122832, AA883479. AA906396, AI082866, AI128465, AA443098, AA478302, N22400, R62948 
AI683470.1, T36141, AA382667, A1218567, 240072, A1028209, AA70668I, AA007319, AI022083, W87682 AI131464 
A1624085.1, AA835706, H68927, R33458, AA280829, H02184, R76465, R79700, F03294, AI076924, AA515913 
AI4617I3, AA637410, AA546383, AI585560.1, AA541923, AA959647, AU024430, AU022981, AA137279, AA60732I 
AU024429, AI505865, AA000683, AA165954, AV037726.1, AI462603, AI414233. AA675510, WI0638, C80158 
AA396049, AA589236, AI4271 15, AV044619.2, AU018321, AU042469, All 17767, AU042619, AJ599140.I, AA924460 
AA963706, AIO07935, AA80I0I2, AI 1 03628, AI232289, AA998746. AII775I8, AI599208.1. AA140989, AA140898 ' 
AI229404. All 12065, AI532103, AI512758, D75940, AA098741, 123866, E05288, E06690, E03372, A06409.1, A46255 1 
115824, E13276. 140308, A58268.1, A401 16.1. E07277, 134427, 185624, A28104.1, E05467 

SEQIDNO:79 

AJ0I0841. AF045432, AFI03726, S78798, AJ0O4935, U48696, AJ010903, YI7148, 297178, AF039698.I, U66300 
U37573, AF032922.I, U39066, AF027174, AF030515. Z49980, AF033097. AF061786. YI5421. AJ001 103, G29058 
G29060, U65376, AF033565, U34048, U52868, AF147449.1, AF033096, S83098, X99051, U44386. AF079586, X65215 
X99055, X80I64. S65683, S65686, S65693, S65694, AJ223292, X65335.1, X99568, X70958, S83538, M24488, X64409* 
M22I35, AF027I26, X65320.I, M80484. W73086. AA307I54, W58564, AA363862, TO6444, AA452335, H78479 
R63I23. T36308. Fl 1379, H59799. W15560, N76641, T83390, F07471, T83556, N24488, H17884, AA293188, AI541284 
TI0785, AA157103, W01696. N89520, H58760. N88782. N83991, AA247964, N83168. N83992, N84048, AA247827, 
N88601, AA096046, N84855, N84718. N83993, H96310, AA095359, N84712, AA471338, N86694, W23637, N84830 
AA096066, AA093224, AA095641, N55698, R84921, N83229. AA093861, N8851 8, N87989, N8801 8, AA249712. 
AA089553, N87898, N56555, N84829, AA2I5908, N88496, AA095435, N84828, AA247965, N8478I, AA093897, 
N561 18. AA093577. AA092086. N89307, N84016. N84721, AA096061, AA249353, N55721, AA089554, AA094237 
N84602, N84723, N84733. W14808, AA124I89, AA423088, AA086801, AI12I283, AAI 19742, AA222785, AA985756 
AA2I8282, AA71 1 181, W33933, AA815685, AA273544, AA238334, AA000754, W9090I, AI316625, W85535, W36243 
AI59S622.I; A5AOS0409, "AAI 06608, AA2I7769, All 19458, AA390040,- / AA879757. AA879644, W571 89; AA220693 
AA1205I5, AA623076, AA939357, W16243, AA9I4937. AA009010. AA674174, AA536703, W7688I, AA667299, ' 
AA048263, AA066010, AAI 17786, AA561056, AA172553, AA177257, AA929573, WI6I54, AA198255, AA833367 
AA8226I5, AA1404I2, AA049167, AA545088, AI56I434.1, AA212687. AA222090. AA867450, AF093453, C82658.I 
C83514.1, AF04I408, AI483326.I. AI483209.I, AI354060, AJ24 1 143.1, AI6 17228.1, AI4832I8.I, AI353694, AI353I69 
AI618568.I, AI618635.1, A1353I59, AI617214.1, AI6I7432.I, AI616808.1. AI616416.I, AI616967.I, AA933116, 
AI6I7405.1, AA866363, AB53413, AI353166, AA933363, AA660164, A1483I20.1, AI483354.1, AI353794, AU061862.I 
AA660I65. H07848, AU061924.I, AU06I949.1, AU062I20.I, AU062001.1, AU062063.1, AU061 975.1, ATO0069I, 
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AC006355.3, AC006045.2, U48386.1, AF044083.1, AL161578.2, AL021633.2, 270270.1, AL080283.1, AL163233.2, 
AL1 63224.2, 274696. 1, U41993.1, D828 13.1, AP00 1 679. 1 , AP001 688. 1 , APOO 1 506. 1 , AP0O096 1 .2, AC005522.2, 
AC008929.3, AC007379.2, AE003664. 1 , AEO035O9.I t ACO 1 2654.2, AC004079.I, AC006478.2, AC004996.I, 
AC0051OO.2, AC007100.3, AC005879.3, AC007617.IO, ACO07437.16, AC0O5331.I, AC004045.1, Y18930.1, 2991 16.1, 
5 X9591 1.1, AL1 17195.1, X63956.I, AL032637. 1, AL109925.il, AL133465.30, AL132639.2, AL132766.13, AL109985.2, 
AL078644.10, AL050322.1O, AL022395.2, 282193.1, YI5880.1, L09228.1, M84227.1, AI951 1 18.1, AW373574.1, 
AA579752.1, AI989660.1, AI825717.1, AW0009I4.1, AI922499.1, AI871874.1, AA991 162.1, AV127940.1, AA736439.1, 
C60377. 1 , AA2 1 9203 . 1 , AA095 151.1, AW578955. 1 , AW4 1 8577. 1 , AW36348 1.1, AW403036. 1 , AW262 107.1, 
AV346364.1, AV322534.I, AV28287U, AI935447.1, A1902224.1, AI754384.1, AV045752.2, AI671778.I, AI591085.1, 
10 AI58390l.l,AI478844.1,AI360552.1,AI31 1562.1, All 68669.1, All 503 10.1, AI086364.1, AA828 186.1, AA746252. 1, 
AA724030.1, AA701829.1, AA280548.I, AAI5I455.1, W 1 6804. 1, N72 190.1 , AC067744.2, AC036170.2, AL157387,2, 
AC024252.3, AL353626.1, AL 162272.4, AP000776.1, ACO 1 7005.4, AC009401.2, ACOl 1254.3, AC012582.3, 
AC012551.3, AC014239.1, AC062004.2, AC068739.2, AC036209.2, AC007131.3, AC061987.1, AC027699.1, 
AC012542.4, AC012248.2, AC013152.1, APO01 133.1, AL022284.1 

15 

SEQIDNO:591 
ZH184/T7 

AL109985.2, AL031662.25, AL163282.2, AC006323.3, AC003684.1, ACOl 13103, AF2I7796.1, AC002564.1, 
AC004130.1, AC004990.1, AC008062.2, AC004987.2, AC006213.1, AF001549. J, AC004638.1, AC004087.1, 

20 AF042O9O.1, AL049709.15, AL031542.1, AL157756.2, AL133399.1, AL031224.1, AC004263.1, AC004019.20, 
AC000052.16, AC004417.1, AC010170.3, AC007957.35, AC025588.1, AC007899.3, AC004854.2, AC004875.1, 
AC006006.2, AC005412.5, AC00719I.I, AC002402.1, AL023494.12, AL137039.1, AL021808.1, AL163262.2, 
AL121601.13, AL035697.19, AL008582.1 1, AL035458.35, 293930.10, AP001717.1, AP00I410.1, AP000190.1, 
AP000159.1, AP000047.1, AP000046.1, AP000302.1, AP000557.2, D87009.1, AP000556.2, AP0001 14.1, AC004890.2, 

25 AC002310.1, AC005523.1, AF088219.1, ALO31589.10, 293023.1, AC008039.1, AC010722.2, AC025436.2, AC009087.4, 
AC009079.4, AF168787.I, AC0061 1 1.2, AC006012.2, AF039907.1, AC006312.8, AC005901.1, AC005772.1, 
AC005754.1, AC005755.1, AC004496.1, 293241.1 1, AL163223.2, U62293.1, AL031 178.1, AP001678.I, AP00I256.2, 
AB023049.1, AP000555.1, AC005072.2, AFO06752.1, AF207550.1, AL163230.2, AL121653.2, APO0t685.1, AC000004.1, 
AC007030.3, AC004821.2, AC006125.1, AL035695.I7, AJ239318.3, AP000432.4, AC005565.1, AC002115.1, 

30 AA5537I0.1, R72458.1, AI471543.1, F36273.1, AI284640.1, AI6 101 59. 1, AW1 93265.1, AI47I481.1, AI334443.1, 

AI053672.1, AA542991.1, AW673241.I, AA825357.1, AA810370.1, AA350859.I, N25296.1, AW769399.1, AW51 1743.1, 
AW276827.1, AW193432.1, AW088058.1, AL046409.1, AI688846.1, AI613280.1, A143 1303.1, AD5021 1.1, AI341664.1, 
AI061334.1, AA179136.1, A1281697.1, AW338086.1, AI358343.1, AA678436.1, AA644538.1, AA521399.1, AA521323J, 
T07451.1, AW731867.I, AW166815.1, AW162049.I, AW029038.1, A1929531.1, AI904894.1, AF 1 50222. 1, A13757 10.1, 

35 AI344844.1, AI340453.1, AI281881.I, AII33164.I, AA649642.1, AA 1 76924. 1, AA 134367.1, AA084070.1, AW339568.1, 
AFI50152.1,AI379719.1,AA77181I.I,AA4918I4.1,AA156538.1,AW276817.1,AI339850.1, AA191620.1, 
AW833903.I, AW517377.1, AI887483.1, AA664015.I, AA599920. 1, AA533725.1, AA483223.1, W79504.1, 
AW600804.1, AW517721.1, F32808.I, AI567674.1, AI168185.1, AA747472.1, AA719805.1, AA63O030.1, AA244357.1, 
N55273.I, AW303196.1, AW30I350.1, AW274349.I, AA581903.1, 

40 AL1 19691.1, A1830390.1, AI298710.1, AA970213.1, AA834713.1, AA280632.1, AA364429.1, T56472.1, AW327868.1, 
AL042853.2, AI537955.I, AA338522. 1, AL157387.2, AC010377.4, AL355887.1, AC022931.3, AL137224.3, AL354864. 1, 
AC021879.3, AC0O5973.4, ACOl 1484.2, AC026331.3, AC025175.2, AC022668.3, AC027472.3, AC012146.4, 
AC027393.3, AC023359.7, AC035I41.2, AC0I2042.9, AC021 160.3, AC021957.3, AC026397.2, ACOl 1768.4, 
AC025054.2, AC013648.3, ACOl 1844.3, AC022989.2, AC022845.2, AC017078.3, AC013733.3, AC010165.2, 

45 AL049537.36, AL136969.5, AL353715.3, AL159175.4, AL138703.2, AL136223.3, AL157372.6, AP0O0631.3, 
AC0I9222.3, AL354723.1, AC055879.2, AC016555.4, AC009122.5, AC016334.2, AC0341 19.1, AC007721.I5, 
AL136221 .8, AC027096.3, AC021 103.6, AC046165.2, AC053540.2, AC010363.5, AC009040.4, AC027709.2, 
AC009506.3, AC008531.2, AC016700.2, ACOl 1430.4, AC025935.2, AC012141.2, AC012308.4, AC018573.2, 
AC015758.3, AC022791.1, AC021661.1, AL158830.6, AL354935.3, AC016586.4, AC010649.5, ACOl 1490.4, 

50 AC008746.5, AC021634.4, AL355515.2, AC026141.3, AC009417.2, AC005910.5, AC008688.6, AC027550.2, 
AC068364.1, AC023133.2, AC026300.2, AC018560.3, AC025818.2, AC020780.3, AC011840.3, AC007799.4, 
AF129075.1, AL161728.2, AL109824.23, AP000931.2, AC064828.3, AL353720.2, AC068889.4, AC021805.3, 
AC022621.4, AC0I6688.4, AL162726.3, AC009070.5, AC021474.3, AL12I752.8 

55 SEQIDNO:592 
2H204/T3 

M33272.1, M62890.I, AC003974.2, D86074.1, NM_001231.1, AE003658.1, NM_007550.1, S73775.1, U85l95:i, 
AE000658.1, U73702.1, U29376.1, AB008674.1, 298263.1, NM_000057.1, NC 001 147.1, AC005517.6, AC008545.3, 
AE003701.1, AF2I4653.1, NM_002095.1, AC002534.1, U71 195.1, AF067418.1, AF005030.1, 272749.1, AC000379.1, 
60 U83248.1, AL137267.I, S46792.1, S67861.1, AJ006995.1, U22183.1, AJ238237.1, U39817.I, U053 14.1, 273546.1, 
274961.1, Z70678.1, X64324.1, X63469.1, D37935.1, J05080.I, M17028.1, X88900.1, AJ006966.1, AC0I3430.4, 
AF198100.1, AE003626.1, AE003455.1, AL161594.2, AL035679.1, AL035331.1, X90518.1, AL121806.2, AL030978.I, 
AL353993.I, AL355925.1, AL034558.2, X53233.1, X87371.1, X94607.1, APOO0388.1, AB009050. 1, AA3 12591.1, 
AW415958.I, AW748894.1, AW748893.1, AW748903.1, AI098848.1, AA007643.1, AU080777.1, AA084882.1, 
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A1585542.I, AA546260.1, AA263149.I, AI505847.1, AW106399.1, AA127538.I, AA285232.1, AW456026.1, 
AI588808. 1 , AI384994. 1 , AA98 1 002. 1 , AW5667 1 2. 1 , AI78775 I.I, AW536727. 1 , A W4 1 3 1 50. 1 , AV265 194.1, 
AV219084.1, AI844907.1, AI 842969.1, AV 160844.1, AV165707.1, AV124038.1, AI779552.1, AV085555.1, AV047038.2, 
AV046630.2, AI325552.I, AU045 190.1, AU01 8790.1 , AU016981.1, AU016595.I, AU016513.1, AU0I5043.I, 
5 AU0I4858.I,AA252091.1,AA197255.1,AAI08210.1,Z74637.1,F01019.1,A^ 

AW398039.1, AW397497.1, AW397370.I, AW397I41.1, AW397013.I, AW396970.1, AW396868.I, AW395825.1, 
AW395703.I, AW395679.1, AW395606.1, AW395515.1, AW225544.1, AW186387.1, AI973567.1, AI960869.1, 
AI94 1 243. 1 , AI94 1225.1, A1940836. 1 , AI88326 1.1, AL079496. 1 , AI795023. 1 , AI759696. 1 , AI748 161.1, A1748087. 1 , 
A1736054.I, A1736025.1, AI736012.1, AI406060.1, AI109316.1, AI109205.1, AI063325.1, AA902197.1, AA784297.1 

10 AA553I06.1,AA497210.l,AA466795.1,AA462438.1,AA432643.1, AA3 1 3904. 1, CI3433.1, Fl 2959. 1, AVI 85 121.1, 

C66585.1, AL138878.4, AL139095.5, AC010893.4, AL109933.21, AC025343.2, AC008760.4, AC024364.3, AC063933.3, 
AC025925.2, AC021I85.2, AC023350.1, AC0I6546.4, AG034197.2, AC021814.2, AC0262I9.1, AC016723.4, 
AC0072I8.2, AC012373.I3, AC022890.1, AGO 17288.1, AC018551.I, ACOI5313.1, AL13901 1.6, AP001075.2, 
AC025685.2, AC061978.2, AC008427.5, AC021869.6, AC02763I.2, AC026233.2, AC013364.7, AC0I3350.6, 

15 AC026232.1, AC018740.2, AC023364J, AC008050.3, AC024301.1, AC01OO03.5, AC009368.5, AC017158.I, 
AC008367.3, AC02OI24.1, AF2I5848.I, AC0O8236.3, AC017830.1, AC017944.1, AC012952.1, AL122026.2, 
AL049185.4, AC068808.4, AC037424.7, AC016634.4, AC008904.3, AC009550.3, AC021972.2, AC02O8O4.2, 
AC021305.3, AC023 103.3, AC020080.1, AC020324.1, AC012195.2, ALI3871 1.3, AL1 601 53.4, AL03 1745.7 

20 SEQ ID NO: 593 
ZH204/T7 

AC012599.8, AC004092.I, L78822.1, L04666.1, NC.O01 146.1, NM 00571 1.1, AC0059I9.1, AC005788.1, AC003036.1, 
U70312.1,AF003530.1,X74595.I,Z71448.1,L20973.1,L19930.1, AI686567.1, AW07355I.1, AA007617.1, AA702832.1, 
AA778768.1, AA127539.1, AA085379.1, F31 106.1, AW196506.1, F36537.1, AW137246.1, AW268860.I, AW582844.1, 

25 All 18179.1, AI651413.1,AW324433.1,AI465698.1,AA073164.1, AW3901O5.1,AA856137.1,AA577233.1, 

AA648320.1, AI990395.1, AA072738.1, AI904456.1, AU024036.1, AI702678. 1, AA 12753 8.1, AI904448.1, A V3 59288.1, 
A1420526.1, AI221321.1, AV2921 10.1, AI616122.1, AA693126.I, AW215056.1, AV318953.1, AI561593.1, AA153299.1, 
AA007643. 1 , AA689696. 1 , A W43 1 906. 1, AV374296. 1 , AVI 55600. 1 , ALI 38878.4, AL139095.5, AC005842.6, 
AC024410.2, AC053543.3, AC008502.4, AC0242I8.2, AC009292.7, AC055730.3, AC009362.6, AC007351.16, 

30 AC055710.3, AC025577.10, AC0242I9.7, AC024146.5, AC022265.2, AC068656.1, ACO 16639.5, AC008422.4, 
AC016632.4, ACO08914.3, AC025763.2, AC024164.2, AC023 194.3, AC034249.1, AC016441.4, AC024469.I, 
AL049 185.4 
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Table 2: Relation between nucleotide sequences and polypeptide sequences 



Nucleic acid SEQ 
ID NO 


Polypeptide 
SEQ ID NO 


Nucleic acid SEQ 
ID NO 


Polypeptide 
SEQ ID NO 


Nucleic acid SEQ 
ID NO 


Polypeptide 
SEQ ID NO 


1 


665 


199 


818 


397 


1117 


2 


678 


200 


819, 820 


398 




3 


679 


201 


821 


399 


1118 


4 


761,762 


202 


822 


400 


1119 


5 


763, 764, 765 


203 


823, 824 


401 


1120 


6 


782 


204 


825, 826 


402 




7 


783 


205 


827, 828, 829 


403 


1121 


8 


767 


206 


830, 831 


404 


1122 


9 


604 


207 


832 


405 


1123 


10 




208 


833 


406 


1124 . , 


11 


606 


209 


834, 835 


407 




12 


624 


210 


836, 837 


408 




13 


599 


211 


838 


409 


1125 


14 


776, 777, 778, 
779 


212 


839, 840, 841 


410 
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15 


780, 781 


213 


842, 843 


41 1 


1 126 


16 


802 


214 


844, 845, 846 


412 


1 127 


17 


803 


215 


847, 848 


413 


1 128 


18 


607 


216 


849 


414 




19 


594 


217 


850, 851,852 


415 


1 129 


20 


595, 596,597 


218 


853, 854 


416 


1130, 1131, 1132 


21 


- 


219 


** 


417 


1133 


22 


598 


220 


— 


418 


1134 


23 




221 


855, 856, 857 


419 


1135, 1136 


24 


600 


222 


858. 859. 860 


420 


1137, 1138 


25 




223 


861,862, 863 


421 


1139 


26 


601 


224 


864 


422 


— 


27 


- 


225 


865 


423 


1140 


28 


602 


226 


866, 867, 868 


424 


1141, 1142 


29 


603 


227 


869, 870 


425 


1143 


30 


605 


228 


871 


426 


1144, 1145 


3i 


- 


229 


872 


427 


— 


32 


608 


230 


873 


428 


1146, 1147 


33 


609 


231 


874 


429 


1148,1149,1150 


34 


610 


232 


875, 876, 877 


430 


— 


35 


611,612 


233 


878 


431 


1151 


36 


- 


234 


879, 880 


432 


1152 


37 


- 


235 


881,882 


433 


~~ 


38 


613 


236 


883 


434 


1153 


39 


614 


237 


884, 885 


435 


1154 


40 


- 


238 


886 


436 




41 


615 


239 


887, 888 


437 


1155 


42 


- 


240 


— 


438 


1156 


43 


616 


241 


889, 890 


439 




44 


— 


242 


891,892 


440 


1157 


45 


617,618 


243 


893, 894 


441 


1 158 


46 


619 


244 


895 


442 


1 1 <o 


47 


620 


245 


896 


443 


1 lAfi 
1 10U 


48 


621 


246 


897 


444 


1 lOl 


49 




247 


898 


A AC 

445 


1 \fS) 
1 IOZ 


50 






899 


446 




- 51 


623 


249 


900,901.. 


447 


1163 


52 


625 


250 


902 


448 




53 




251 


903 


449 


1164 


54 


626 


252 


904 


450 


1165 


55 


627, 628 


253 


905 


451 


1166 



DMC rw 
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5$ 
IT 



629 



254 
255 



906 
907, 908, 909 



452 
453 



1167 



58 



630 



256 



910,911 



454 



59 



257 



912,913,914 



455 



1168 



60 



631 



258 



915 



456 



61 

"62* 



632 



259 



916 



457 



1169 



260 



917,918,918 



458 



1170 



63 
"64" 



633 



261 



920, 921 



459 



117i 



634 



262 



922, 923 



460 



1172 



65 



635 



263 



924, 925, 926 



461 



1173, 1174,1175 



66 



67 



636 



264 



927, 928, 929 



462 



265 



930, 931 



463 



1176,1177 



1178 



68 



266 



932 



464 



1179 



69 

"to" 



637 



267 



933, 934 



465 



1180 



268 



935 



466 



1181,1182,1183 



71 

~72~ 



269 



936 



467 



1184 



270 



937, 938 



468 



1185 



73 



271 



939 



469 



1186 



74 



272 



940 



470 



1187 



273 



941,942 



471 



1188 



76 



274 



943, 944 



472 



77 
IT 



638 



275 



945 



473 



1189 



276 



946, 947, 948 



474 



1190 



79 

"so" 



277 



949 



475 



81 



639 



278 



950, 951,952 



476 



279 



953, 954 



477 



1191, 1192 



1193 



83 
IT 



640 



641 



280 



955, 956 



478 



281 



479 



282 



957 



480 



1194 



1195 



1196 



85 



283 



958 



481 



1197 



642 



284 



959 



482 



1198 



87 



285 



960,961 



483 



1199 



88 

IT 



643 



286 



484 



1200, 1201 



287 



962, 963, 964 



485 



1202 



90 
~9T 



644 



288 



965 



486 



1203 



289 



966 



487 



1204 



92 



645 



290 



967, 968, 969, 
970 



488 



1205 



93 



646 



291 



971,972,973, 
974 



489 



1206 



94 

"§5~ 



647 



292 



975, 976 



490 



1207 



648 



293 



977, 978, 979, 



491 



1208 
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980 






96 


649, 650 


294 


981,982, 983 


492 


1209 


97 


651,652 


295 


984, 985 


493 


1210 


98 


653 


296 


986, 987 


494 




99 


654 


297 


988, 989 


495 


1211 


100 


655 


298 


990 


496 


1212 


101 


656 


299 


991 


497 


1213 


102 


657 


300 


992 


498 


— 


103 


658 


301 


993 


499 


1214 


104 


659,660 


302 


994 


500 


— 


105 


661 


303 


995 


501 


1215 


106 


662, 663 


304 


996 


502 


1216 


107 


664 


305 


997 


503 


1217 


108 


666 


306 


998 


504 


1218 


109 


667 


307 


999 


505 


1219 


110 


- 


308 


1000, 1001 


506 


1220 


111 


668,669, 670 


309 


1002, 1003 


507 


1221 


112 


671,672, 673, 
674, 675 


310 


1004, 1005 


508 


1222 


113 


676 


311 


1006, 1007, 1008 


509 


— 


114 


677 


312 


1009, 1010 


510 


1223 


115 


680 


313 


1011, 1012,1013 


511 


1224 


116 


681 


314 


1014, 1015 


512 


1225 


117 


682,683 


315 


1016, 1017 


513 


1226 


118 


684 


316 


1018, 1019 


514 


1227 


119 


685 


317 


1020 


515 


1228 


120 


686 


318 


1021 


516 


1229, 1230 


121 


687,688 


319 


1022 


517 


1231 


122 


689, 690, 691 


320 


1023 


518 


~~ 


123 


692 


321 


1024, 1025, 1026 


519 


1232 


124 


693,694 


322 


1027, 1028, 1029, 
1030 


520 


1233 


125 


695 


323 


1031, 1032, 1033 


521 


1234 


126 


696 


324 


1034, 1035, 1036 


522 


1235 


127 


697 


325 


1037, 1038 


523 




128 


698 


326 


1039, 1040 


524 


1236 


129 


699 


327 


1041, 1042, 104J, 
1044 


jZj 




130 


700, 701,702 


328 


1045, 1046 


526 


1237 


131 


703, 704 


329 


1047 


527 




132 


705 


330 


1048, 1049 


528 


1238 


133 


706, 707 


331 


1050 


529 


1239 
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134 



708 



332 



1051, 1052 



530 



1240, 1241 



135 



709,710 



333 



1053 



531 



1242, 1243, 1244 



136 



137 



138 



711 



334 



1054, 1055 



532 



712 



335 



1056 



533 



713 



336 



1057, 1058 



534 



1245 



1246, 1247 



139 



140 



141 



714 



337 



1059 



535 



715 



338 



1060, 1061 



536 



716 



339 



1062 



537 



1248 



1249, 1250, 1251 



1252, 1253, 1254 



142 



717 



340 



1063 



538 



143 



718 



341 



1064, 1065 



539 



1255 



144 



719 



342 



1066 



540 



1256 



145 



720 



343 



1067, 1068 



541 



1257 



146 



721 



344 



1069 



542 



1258 



147 



722 



345 



1070, 1071 



543 



1259, 1260 



148 
T49" 



723,724 



346 



1072 



544 



1261 



725, 726 



347 



1073, 1074 



545 



1262 



150 



727 



348 



1075, 1076 



546 



1263 



151 



728, 729 



349 



1077, 1078, 1079 



547 



1264 



152 
153 



730, 731 



350 



1080 



548 



1265, 1266 



732, 733 



351 



1081 



549 



1267 



154 



734, 735, 736 



352 



1082, 1083 



550 



155 



156 



737 



353 



1084 



551 



738 



354 



1085 



552 



1268 



1269, 1270 



157 



158 



159 



739, 740 



355 



553 



741 



356 



1086 



554 



742, 743, 744 



357 



555 



1271, 1272 



1273, 1274 



1275 



160 



745 



358 



1087 



556 



1276 



161 



746 



359 



1088 



557 



747, 748 



360 



1089 



558 



1277 



1278 



163 



164 



749 



361 



1090 



559 



750 



362 



1091 



560 



1279 



1280 



165 



751,752 



363 



561 



1281 



166 



753 



364 



1092 



562 



1282, 1283 



167 



754,755, 756 



365 



563 



1284, 1285 



168 



757, 758, 759, 
760 



366 



1093 



564 



1286, 1287 



169 



766 



367 



1094, 



565 



1288, 1289, 1290 



170 



768 



368 



1095 



566 



1291, 1292 



171 



769 



369 



1096 



567 



1293, 1294 



172 



173 



770 



370 



1097 



568 



771 
"77T 



371 



569 
170" 



1295 



1296, 1297, 1298 
~~1299 
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175 


773, 774, 775 


373 


1098 


571 


i inn 


176 


784 


374 


1099 




1JU1 


177 


785 


375 




573 


i wv? l im 


178 


786, 787, 788 


376 


1100 


574 




179 


— 


377 


1 101 


etc 

575 




180 


789 


378 


1102 


576 




181 


790 


379 


1103 


577 


1307 


182 


791 


380 


1104 


578 


..J308 


183 


792, 793 


381 


1105 


579 


1309 


184 


794 


382 


1106 


580 


1310 


185 


795, 796 


383 


1107 


581 


1311, 1312, 1313 


186 


797, 798, 799, 
800 


384 


1108 


582 


1314, 1315, 1316 


187 


801 


385 




583 


1317, 1318 


188 


804, 805 


386 


1109 


584 


1319 


189 


806 


387 


1110 


585 


1320, 1321 


190 


807 


388 


1111 


586 


1322 


191 


808, 809 


389 


— 


587 


1323, 1324 


192 


810 


390 


1112 


588 


1325 


193 


811 


391 




JO? 


1326 


194 


812 


392 


1113 


590 


1327 


195 


813 


393 


1114 


591 


1328, 1329 


196 


814 


394 


1115 


592 


1330, 1331 


197 


815 


395 




593 


1332 


198 


816,817 


396 


1116 







Example 2: Preparation of recombinant cancer associated antigens 

To facilitate screening of patients' sera for antibodies reactive with cancer associated 
antigens, for example by ELISA, recombinant proteins are prepared according to standard 
procedures. Where gaps exist in the gene sequences represented by the clones disclosed 
herein, or where flanking sequences are desired, such nucleic acid sequences can be isolated 
according to standard procedures. For example, where 5' and 3* clones of a gene sequence are 
known, PCR primers can be designed for amplification of the nucleotide sequence between 
the clones. Flanking sequences can be isolated using procedures such as RACE PCR. Such 
sequences also can be isolated by standard hybridization cloning protocols. 

In one method of preparing recombinant cancer associated antigens, the clones 
encoding cancer associated antigens are subcloned into a baculovirus expression vector, and 
the recombinant expression vectors are introduced into appropriate insect cells. 
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Baculovirus/insect cloning systems are preferred because post-translational modifications are 
carried out in the insect cells. Another preferred eukaryotic system is the Drosophila 
Expression System from Invitrogen. Clones which express high amounts of the recombinant 
protein are selected and used to produce the recombinant proteins. The recombinant proteins 

5 are tested for antibody recognition using serum from the patient which was used to isolated 
the particular clone, or in the case of cancer associated antigens recognized by allogeneic sera, 
by the sera from any of the patients used to isolate the clones or sera which recognize the 
clones' gene products. 

Alternatively, the cancer associated antigen clones are inserted into a prokaryotic 

) expression vector for production of recombinant proteins in bacteria. Other systems, 
including yeast expression systems and mammalian cell culture systems also can be used. 

Example 3: Preparation of antibodies to cancer associated antigens 

The recombinant cancer associated antigens produced as in Example 2 above are used 
to generate polyclonal antisera and monoclonal antibodies according to standard procedures. 
The antisera and antibodies so produced are tested for correct recognition of the cancer 
associated antigens by using the antisera/antibodies in assays of cell extracts of patients 
known to express the particular cancer associated antigen (e.g. an ELISA assay). These 
antibodies can be used for experimental purposes (e.g. localization of the cancer associated 
antigens, immunoprecipitations, Western blots, etc.) as well as diagnostic purposes (e.g., 
testing extracts of tissue biopsies, testing for the presence of cancer associated antigens). 

Example 4: Expression of breast, gastric and prostate cancer associated antigens in 
cancers of similar and different origin. 

The expression of one or more of the breast, gastric and/or prostate cancer associated 
antigens is tested in a range of tumor samples to determine which, if any, other malignancies 
should be diagnosed and/or treated by the methods described herein. Tumor cell lines and 
tumor samples are tested for cancer associated antigen expression, preferably by RT-PCR 
according to standard procedures. Northern blots also are used to test the expression of the 
cancer associated antigens. Antibody based assays, such as ELISA and western blot, also can 
be used to determine protein expression. A preferred method of testing expression of cancer 
associated antigens (in other cancers and in additional same type cancer patients) is allogeneic 
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serotyping using a modified SEREX protocol (as described above). 

In all of the foregoing, extracts from the tumors of patients who provided sera for the 
initial isolation of the cancer associated antigens are used as positive controls. The cells 
containing recombinant expression vectors described in the Examples above also can be used 
as positive controls. 

The results generated from the foregoing experiments provide panels of multiple 
cancer associated nucleic acids and/or polypeptides for use in diagnostic (e.g. determining the 
existence of cancer, determining the prognosis of a patient undergoing therapy, etc.) and 
therapeutic methods (e.g., vaccine composition, etc.). 

Example 5: HLA typing of patients positive for cancer associated antigen 

To determine which HLA molecules present peptides derived from the cancer 
associated antigens, cells of the patients which express the breast and/or gastric cancer 
associated antigens are HLA typed. Peripheral blood lymphocytes are taken from the patient 
and typed for HLA class I or class n, as well as for the particular subtype of class I or class IL 
Tumor biopsy samples also can be used for typing. HLA typing can be carried out by any of 
the standard methods in the art of clinical immunology, such as by recognition by specific 
monoclonal antibodies, or by HLA allele-specific PCR (e.g. as described in W097/3 1126). 

Example 6: Characterization of cancer associated antigen peptides presented by MHC 
class I and class H molecules. 

Antigens which provoke an antibody response in a subject may also provoke a cell- 
mediated immune response. Cells process proteins into peptides for presentation on MHC 
class I or class n molecules on the cell surface for immune surveillance. Peptides presented 
by certain MHC/HLA molecules generally conform to motifs. These motifs are known in 
some cases, and can be used to screen the breast and/or gastric cancer associated antigens for 
the presence of potential class I and/or class II peptides. Summaries of class I and class II 
motifs have been published (e.g., Rammensee et al., Immunogenetics 41:178-228, 1995). 
Based on the results of experiments such as those described above, the HLA types which 
present the individual breast cancer associated antigens are known. Motifs of peptides 
presented by these HLA molecules thus are preferentially searched. 

One also can search for class I and class II motifs using computer algorithms. For 
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example, computer programs for predicting potential CTL epitopes based on known class I 
motifs has been described (see, e.g., Parker et al, J. Immunol 152:163, 1994; D'Amaro et al., 
Human Immunol 43:13-18, 1995; Drijfhout et al., Human Immunol. 43:1-12, 1995). HLA 
binding predictions can conveniently be made using an algorithm available via the Internet on 
the National Institutes of Health World Wide Web site at URL http:^imas.dcrt.nih.gov. 
Methods for determining HLA class II peptides and making substitutions thereto are also 
known (see, e.g. International applications PCT/US96/03182 and PCT/US98/01373). 
Computer software for selecting HLA class II binding peptides is also available (TEPITOPE; 
Stumiolo et al., Nature Biotechnol 17:555-561, 1999; Manici et al., J. Exp. Med. 189:871- 
876, 1999). Peptides which are thus selected can be for inducing specific CD4 + lymphocytes 
and identification of peptides. Additional methods of selecting and testing peptides for HLA 
class BE binding are well known in the art. 

Example 7: Identification of the portion of a cancer associated polypeptide encoding an 
antigen 

To determine if the cancer associated antigens isolated as described above can provoke 
a cytolytic T lymphocyte response, the following method is performed. CTL clones are 
generated by stimulating the peripheral blood lymphocytes (PBLs) of a patient with 
autologous normal cells transfected with one of the clones encoding a cancer associated 
antigen polypeptide or with irradiated PBLs loaded with synthetic peptides corresponding to 
the putative protein and matching the consensus for the appropriate HLA class I molecule (as 
described above) to localize an antigenic peptide within the cancer associated antigen clone 
(see, e.g., Knuth et al., Proc. Natl Acad. ScL USA 81:351 1-3515, 1984; van der Braggen et 
al., Eur. J. 7/nmwno/.24:3038-3043, 1994). These CTL clones are screened for specificity 
against COS cells transfected with the cancer associated antigen clone and autologous HLA 
alleles as described by Brichard et al. (Eur. J. Immunol. 26:224-230, 1996). CTL recognition 
of a cancer associated antigen is determined by measuring release of TNF from the cytolytic T 
lymphocyte or by 5i Cr release assay (Herin et al, Int. J. Cancer 39:390-396, 1987). If a CTL 
clone specifically recognizes a transfected COS cell, then shorter fragments of the cancer 
associated antigen clone transfected in that COS cell are tested to identify the region of the 
gene that encodes the peptide. Fragments of the cancer associated antigen clone are prepared 
by exonuclease JH digestion or other standard molecular biology methods. Synthetic peptides 
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are prepared to confirm the exact sequence of the antigen. 

Optionally, shorter fragments of cancer associated antigen cDNAs are generated by 
PCR. Shorter fragments are used to provoke TNF release or 51 Cr release as above. 

Synthetic peptides corresponding to portions of the shortest fragment of the cancer 
5 associated antigen clone which provokes TNF release are prepared. Progressively shorter 
peptides are synthesized to determine the optimal cancer associated antigen tumor rejection 
antigen peptides for a given HLA molecule. 

A similar method is performed to determine if the cancer associated antigen contains 
one or more HLA class II peptides recognized by T cells. One can search the sequence of the 
10 cancer associated antigen polypeptides for HLA class II motifs as described above. In contrast 
to class I peptides, class II peptides are presented by a limited number of cell types. Thus for 
these experiments, dendritic cells or B cell clones which express HLA class II molecules 
preferably are used. 

15 Example 8: Recognition of cancer antigens by cancer patient sera 

Several of the cancer antigen identified herein were tested for reactivity with sera from 
normal and breast cancer patients according to standard procedures (e.g., the SEREX 
procedure outlined above). 



20 Table 3: Serology of antigens 



SEQ 
ID NO 


Gene/Clone 


Breast Cancer 
Patient Sera 


Normal 
Sera 


1 


Br-38/HSP105 (MK) 


6/31 


0/30 


2,3 


Br-39/HSP105 (MK) 


3/31 


0/30 


4,5 


RGS-GA1P interacting protein GIPC (MK) 


3/31 


0/30 


6,7 


NS1 -binding protein/KIAA0850 (MK) 


3/31 


0/30 


8 


Opa-interacting protein OIP2 (MK) 


3/31 


0/30 


9,10 


Kinesin family protein 3B (KEF3B) (MT) 


2/31 


0/30 


11 


Endothelial-monocyte activating protein (EMAP2) (MT) 


2/31 


0/30 


12 


Unknown TOM1 protein (MT3 1 1) 


2/31 


0/30 


13 


Outer mitochodrial membrane protein 34kDa (MT) 


1/31 


0/30 
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14,15 


IPL(MK) 


1/31 


0/30 


16,17 


Mus ACF7 neural isoform (MK) 


1/31 


0/30 


18 


Cyclin D3 (MT) 


1/31 


0/30 



The data show that proteins encoded by SEQ ID NO: 1-1 2 were recognized by multiple 
breast cancer patients' sera, but not by control individuals' sera. Proteins encoded by SEQ ID 
NO: 13-1 8 were recognized by only a single breast cancer patient's sera, but not by control 
5 individuals' sera. The 



EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
10 described herein. Such equivalents are intended to be encompassed by the following claims. 
All references disclosed herein are incorporated by reference in their entirety. 

We claim: 
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Claims 

1 . A method of diagnosing a disorder characterized by expression of a human cancer 
associated antigen precursor coded for by a nucleic acid molecule, comprising: 

contacting a biological sample isolated from a subject with an agent that specifically 
binds to the nucleic acid molecule, an expression product thereof, or a fragment of an 
expression product thereof complexed with an HLA molecule, wherein the nucleic acid 
molecule is a NA Group 1 nucleic acid molecule, and 

determining the interaction between the agent and the nucleic acid molecule or the 
expression product as a determination of the disorder. 

2. The method of claim 1, wherein the agent is selected from the group consisting of 

(a) a nucleic acid molecule comprising NA group 1 nucleic acid molecules or a 
fragment (hereof, 

(b) a nucleic acid molecule comprising NA group 3 nucleic acid molecules or a 
fragment thereof 

(c) a nucleic acid molecule comprising NA group 5 nucleic acid molecules or a 
fragment thereof, 

(d) an antibody that binds to an expression product of NA group 1 nucleic acids, 

(e) an antibody that binds to an expression product of NA group 3 nucleic acids, 

(f) an antibody that binds to an expression product of NA group 5 nucleic acids, 

(g) an agent that binds to a complex of an HLA molecule and a fragment of an 
expression product of a NA group 1 nucleic acid, 

(h) an agent that binds to a complex of an HLA molecule and a fragment of an 
expression product of a NA group 3 nucleic acid, and 

(i) an agent that binds to a complex of an HLA molecule and a fragment of an 
expression product of a NA group 5 nucleic acid, 

3. The method of claim 1, wherein the disorder is characterized by expression of a 
plurality of human cancer associated antigen precursors and wherein the agent is a plurality of 
agents, each of which is specific for a different human cancer associated antigen precursor, 
and wherein said plurality of agents is at least 2, at least 3, at least 4, at least 4, at least 6, at 
least 7, or at least 8, at least 9 or at least 10 such agents. 
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4. The method of claims 1-3, wherein the agent is specific for a human cancer associated 
antigen precursor that is a breast cancer associated antigen precursor. 

5 5. A method for determining regression, progression or onset of a condition characterized 
by expression of abnormal levels of a protein encoded by a nucleic acid molecule that is a NA 
Group 1 molecule, comprising 

monitoring a sample, from a patient who has or is suspected of having the condition, 
for a parameter selected from the group consisting of 
1° (i) the protein, 

(ii) a peptide derived from the protein, 

(iii) an antibody which selectively binds the protein or peptide, and 

(iv) cytolytic T cells specific for a complex of the peptide derived from the 
protein and an MHC molecule, 

15 as a determination of regression, progression or onset of said condition. 

6. The method of claim 5, wherein the sample is a body fluid, a body effusion or a tissue. 

7. The method of claim 5, wherein the step of monitoring comprises contacting the 
20 sample with a detectable agent selected from the group consisting of 

(a) an antibody which selectively binds the protein of (i), or the peptide of (ii), 

(b) a protein or peptide which binds the antibody of (iii), and 

(c) a cell which presents the complex of the peptide and MHC molecule of (iv). 

25 8. The method of claim 7, wherein the antibody, the protein, the peptide or the cell is 
labeled with a radioactive label or an enzyme. 

9. The method of claim 5, comprising assaying the sample for the peptide. 
30 10. The method of claim 5, wherein the nucleic acid molecule is a NA Group 3 molecule. 
11. The method of claim 5, wherein the nucleic acid molecule is a NA Group 5 molecule. 
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12. The method of claim 5, wherein the protein is a plurality of proteins, the parameter is a 
plurality of parameters, each of the plurality of parameters being specific for a different one of 
the plurality of proteins, at least one of which is a cancer associated protein encoded by a NA 

5 Group 1 molecule. 

13. A pharmaceutical preparation for a human subject comprising 

an agent which when administered to the subject enriches selectively the presence of 
complexes of an HLA molecule and a human cancer associated antigen, and 
10 a pharmaceutically acceptable carrier, wherein the human cancer associated antigen is 

a fragment of a human cancer associated antigen precursor encoded by a nucleic acid 
molecule comprises a NA Group 1 molecule. 

14. The pharmaceutical preparation of claim 13, wherein the agent comprises a plurality of 
15 agents, each of which enriches selectively in the subject complexes of an HLA molecule and a 

different human cancer associated antigen, wherein at least one of the human cancer 
associated antigens is encoded by a NA Group 1 molecule. 

15. The pharmaceutical preparation of claim 14, wherein the plurality is at least two, at 
20 least three, at least four or at least 5 different such agents. 

16. The pharmaceutical preparation of claim 13, wherein the nucleic acid molecule is a 
NA Group 3 nucleic acid molecule. 

25 17. The pharmaceutical preparation of claim 13, wherein the agent is selected from the 
group consisting of 

(1) an isolated polypeptide comprising the human cancer associated antigen, or a 
functional variant thereof, 

(2) an isolated nucleic acid operably linked to a promoter for expressing the isolated 

30 polypeptide, or functional variant thereof, 

(3) a host cell expressing the isolated polypeptide, or functional variant thereof, and 

(4) isolated complexes of the polypeptide, or functional variants thereof, and an HLA 
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molecule. 

18. The pharmaceutical preparation of claims 13-17, further comprising an adjuvant 

5 19. The pharmaceutical preparation of claim 13, wherein the agent is a cell expressing an 
isolated polypeptide comprising the human cancer associated antigen or a functional variant 
thereof, and wherein the cell is nonproliferative. 

20. The pharmaceutical preparation of claim 13, wherein the agent is a cell expressing an 
10 isolated polypeptide comprising the human cancer associated antigen or a functional variant 

thereof; and wherein the cell expresses an HLA molecule that binds the polypeptide. 

21 . The pharmaceutical preparation of claim 13, wherein the agent is at least two, at least 
three, at least four or at least five different polypeptides, each coding for a different human 

15 cancer associated antigen or functional variant thereof, wherein at least one of the human 
cancer associated antigens is encoded by a NA Group 1 molecule. 

22. The pharmaceutical preparation of claim 13, wherein the agent is a PP Group 2 
polypeptide. 

20 

23. The pharmaceutical preparation of claim 13, wherein the agent is a PP Group 3 
polypeptide or a PP Group 4 polypeptide. 

24. The pharmaceutical preparation of claim 20, wherein the cell expresses one or both of 
25 the polypeptide and HLA molecule recombinantly. 

25. The pharmaceutical preparation of claim 20, wherein the cell is nonproliferative. 

26. A composition comprising 

30 an isolated agent that binds selectively a PP Group 1 polypeptide. 



27. The composition of matter of claim 26, wherein the agent binds selectively a PP Group 



WO 00/73801 



-260- 



PCT/US00/14749 



2 polypeptide. 

28. The composition of matter of claim 26, wherein the agent binds selectively a PP Group 

3 polypeptide. 

5 

29. The composition of matter of claim 26, wherein the agent binds selectively a PP Group 

4 polypeptide. 

30. The composition of matter of claim 26, wherein the agent binds selectively a PP Group 
10 5 polypeptide. 

3 1 . The composition of claims 26-30, wherein the agent is a plurality of different agents 
that bind selectively at least two, at least three, at least four, or at least five different such 
polypeptides. 

15 

32. The composition of claims 26-30, wherein the agent is an antibody. 

33. The composition of claim 31, wherein the agent is an antibody. 

20 34. A composition of matter comprising 

a conjugate of the agent of claims 26-30 and a therapeutic or diagnostic agent 

35. A composition of matter comprising 

a conjugate of the agent of claim 31 and a therapeutic or diagnostic agent. 

25 

36. The composition of matter of claim 34, wherein the conjugate is of the agent and a 
therapeutic or diagnostic that is a toxin. 

37. A pharmaceutical composition comprising an isolated nucleic acid molecule selected 
30 from the group consisting of NA Group 1 molecules and NA Group 2 molecules, and a 

pharmaceutically acceptable carrier. 
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38. The pharmaceutical composition of claim 37, wherein the isolated nucleic acid 
molecule comprises a NA Group 3 or NA Group 4 molecule. 

39. The pharmaceutical composition of claim 37, wherein the isolated nucleic acid 
molecule comprises at least two isolated nucleic acid molecules coding for two different 
polypeptides, each polypeptide comprising a different human cancer associated antigen. 

40. The pharmaceutical composition of claims 37-39 further comprising an expression 
vector with a promoter operably linked to the isolated nucleic acid molecule. 

4 1 . The pharmaceutical composition of claims 37-39 further comprising a host cell 
recombinantly expressing the isolated nucleic acid molecule. 

42. A pharmaceutical composition comprising 

an isolated polypeptide comprising a PP Group 1 or a PP Group 2 polypeptide, and 
a pharmaceutically acceptable carrier. 

43. The pharmaceutical composition of claim 42, wherein the isolated polypeptide 
comprises a PP Group 3 or a PP Group 4 polypeptide. 

44. The pharmaceutical composition of claim 42, wherein the isolated polypeptide 
comprises at least two different polypeptides, each comprising a different human cancer 
associated antigen. 



45. The pharmaceutical composition of claim 42, wherein the isolated polypeptides are 
breast cancer polypeptides or HLA binding fragments thereof. 

46. The pharmaceutical composition of claim 42, wherein the isolated polypeptides are 
gastric cancer polypeptides or HLA binding fragments thereof. 



47. The pharmaceutical composition of claims 42-46, further comprising an adjuvant. 
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48. An isolated nucleic acid molecule comprising a NA Group 3 molecule. 



49, An isolated nucleic acid molecule comprising a NA Group 4 molecule. 

50. An isolated nucleic acid molecule selected from the group consisting of 

(a) a fragment of a nucleic acid molecule having a nucleotide sequence selected from 
the group consisting of nucleotide sequences set forth as SEQ ID NOs: 1-593, of sufficient 
length to represent a sequence unique within the human genome, and identifying a nucleic 
acid encoding a human cancer associated antigen precursor, 

(b) complements of (a), 

provided that the fragment includes a sequence of contiguous nucleotides which is not 
identical to any sequence selected from the sequence group consisting of 

(1) sequences having the GenBank accession numbers of Table 1, and other publicly 
available sequences, 

(2) complements of (1), and 

(3) fragments of (1) and (2). 

isolated nucleic acid molecule of claim 50, wherein the sequence of contiguous 
is selected from the group consisting of: 

at least two contiguous nucleotides nonidentical to the sequence group, 
at least three contiguous nucleotides nonidentical to the sequence group, 
at least four contiguous nucleotides nonidentical to the sequence group, 
at least five contiguous nucleotides nonidentical to the sequence group, 
at least six contiguous nucleotides nonidentical to the sequence group, 
at least seven contiguous nucleotides nonidentical to the sequence group. 

52. The isolated nucleic acid molecule of claim 50, wherein the fragment has a size 
selected from the group consisting of at least: 8 nucleotides, 10 nucleotides, 12 nucleotides, 
14 nucleotides, 16 nucleotides, 18 nucleotides, 20, nucleotides, 22 nucleotides, 24 
nucleotides, 26 nucleotides, 28 nucleotides, 30 nucleotides, 50 nucleotides, 75 nucleotides, 
100 nucleotides, and 200 nucleotides. 



51. The 
nucleotides 

0) 
(2) 
(3) 
(4) 
(5) 
(6) 
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53. The isolated nucleic acid molecule of claim 50, wherein the molecule encodes a 
polypeptide which, or a fragment of which, binds a human HLA receptor or a human 
antibody, 

5 54. An expression vector comprising an isolated nucleic acid molecule of any of claims 
48-53 operably linked to a promoter. 

55. An expression vector comprising a nucleic acid operably linked to a promoter, wherein 
the nucleic acid is a NA Group 2 molecule. 

10 

56. An expression vector comprising a NA Group 1 or Group 2 molecule and a nucleic 
acid encoding an HLA molecule. 

57. A host cell transformed or transfected with an expression vector of claim 54. 

15 

58. A host cell transformed or transfected with an expression vector of claims 55 or 56. 

59. A host cell transformed or transfected with an expression vector of claim 54 and 
further comprising a nucleic acid encoding HLA. 

20 

60. A host cell transformed or transfected with an expression vector of claim 55 and 
further comprising a nucleic acid encoding HLA. 

61 . An isolated polypeptide encoded by the isolated nucleic acid molecule of claim 48 or 
25 claim 49. 

62. A fragment of the polypeptide of claim 61 which is immunogenic. 

63. The fragment of claim 62, wherein the fragment, or a portion of the fragment, binds 
30 HLA or a human antibody. 

64. An isolated fragment of a human cancer associated antigen precursor which, or portion 
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of which, binds HLA or a human antibody, wherein the precursor is encoded by a nucleic acid 
molecule that is a NA Group 1 molecule. 

65. The fragment of claim 64, wherein the fragment is part of a complex with HLA. 

5 

66. The fragment of claim 65, wherein the fragment is between 8 and 12 amino acids in 
length. 

67. An isolated polypeptide comprising a fragment of the polypeptide of claim 61 of 

10 sufficient length to represent a sequence unique within the human genome and identifying a 
polypeptide that is a human cancer associated antigen precursor. 

68. A kit for detecting the presence of the expression of a human cancer associated antigen 
precursor comprising 

15 a pair of isolated nucleic acid molecules each of which consists essentially of a 

molecule selected from the group consisting of (a) a 12-32 nucleotide contiguous segment of 
the nucleotide sequence of any of the NA Group 1 molecules and (b) complements of (a), 
wherein the contiguous segments are nonoverlapping. 



20 



69. The kit of claim 68, wherein the pair of isolated nucleic acid molecules is constructed 
and arranged to selectively amplify an isolated nucleic acid molecule that is a NA Group 3 
molecule. 

70. A method for treating a subject with a disorder characterized by expression of a human 
25 cancer associated antigen precursor, comprising 

administering to the subject an amount of an agent, which enriches selectively in the 
subject the presence of complexes of an HLA molecule and a human cancer associated 
antigen, effective to ameliorate the disorder, wherein the human cancer associated antigen is a 
fragment of a human cancer associated antigen precursor encoded by a nucleic acid molecule 
30 selected from the group consisting of 

(a) a nucleic acid molecule comprising NA group 1 nucleic acid molecules, 

(b) a nucleic acid molecule comprising NA group 3 nucleic acid molecules, 
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a nucleic acid molecule comprising NA group 5 nucleic acid molecules. 



71. The method of claim 70, wherein the disorder is characterized by expression of a 
plurality of human cancer associated antigen precursors and wherein the agent is a plurality of 

5 agents, each of which enriches selectively in the subject the presence of complexes of an HLA 
molecule and a different human cancer associated antigen, wherein at least one of the human 
cancer associated antigens is encoded by a NA Group 1 molecule. 

72. The method of claim 71 , wherein the plurality is at least 2, at least 3, at least 4, or at 
10 least 5 such agents. 

73. The method of claims 70-72, wherein the agent is an isolated polypeptide selected 
from the group consisting of PP Group 1, PP Group 2, PP Group 3, PP Group 4, and PP 
Group 5. 

15 

74. The method of claims 70-72, wherein the disorder is cancer. 

75. The method of claims 73, wherein the disorder is cancer. 

20 76. A method for treating a subject having a condition characterized by expression of a 
human cancer associated antigen precursor in cells of the subject, comprising: 

(i) removing an immunoreactive cell containing sample from the subject, 

(ii) contacting the immunoreactive cell containing sample to the host cell under 
conditions favoring production of cytolytic T cells against a human cancer associated antigen 

25 which is a fragment of the precursor, 

(iii) introducing the cytolytic T cells to the subject in an amount effective to lyse 
cells which express the human cancer associated antigen, wherein the host cell is transformed 
or transfected with an expression vector comprising an isolated nucleic acid molecule 
operably linked to a promoter, the isolated nucleic acid molecule being selected from the 

30 group of nucleic acid molecules consisting of NA Group 1 , NA Group 2, NA Group 3, NA 
Group 4, and NA Group 5. 
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77. The method of claim 76, wherein the host ceil recombinantly expresses an HLA 
molecule which binds the human cancer associated antigen. 



78. The method of claim 76, wherein the host cell endogenously expresses an HLA 
molecule which binds the human cancer associated antigen. 



79. A method for treating a subject having a condition characterized by expression of a 
human cancer associated antigen precursor in cells of the subject, comprising: 

(i) identifying a nucleic acid molecule expressed by the cells associated with said 
10 condition, wherein said nucleic acid molecule is a NA Group 1 molecule; 

(ii) transfecting a host cell with a nucleic acid selected from the group consisting 
of (a) the nucleic acid molecule identified, (b) a fragment of the nucleic acid identified which 
includes a segment coding for a human cancer associated antigen, (c) deletions, substitutions 
or additions to (a) or (b), and (d) degenerates of (a), (b), or (c); 

15 (Hi) culturing said transfected host cells to express the transfected nucleic acid 

molecule, and; 

(iv) introducing an amount of said host cells or an extract thereof to the subject 
effective to increase an immune response against the cells of the subject associated with the 
condition. 

20 

80. The method of claim 79, further comprising identifying an MHC molecule which 
presents a portion of an expression product of the nucleic acid molecule, wherein the host cell 
expresses the same MHC molecule as identified and wherein the host cell presents an MHC 
binding portion of the expression product of the nucleic acid molecule. 

25 

81 . The method of claim 79, wherein the immune response comprises a B-cell response or 
a T cell response. 

82. The method of claim 8 1 , wherein the response is a T-cell response which comprises 
30 generation of cytolytic T-cells specific for the host cells presenting the portion of the 

expression product of the nucleic acid molecule or cells of the subject expressing the human 
cancer associated antigen. 
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83. The method of claim 79, wherein the nucleic acid molecule is a NA Group 3 molecule. 

84. The method of claims 79 or 80, further comprising treating the host cells to render 
5 them non-proliferative. 

85. A method for treating or diagnosing or monitoring a subject having a condition 
characterized by expression of an abnormal amount of a protein encoded by a nucleic acid 
molecule that is a NA Group 1 molecule, comprising 

10 administering to the subject an antibody which specifically binds to the protein or a 

peptide derived therefrom, the antibody being coupled to a therapeutically useful agent, in an 
amount effective to treat the condition. 

86. The method of claim 85, wherein the antibody is a monoclonal antibody. 

15 

87. The method of claim 86, wherein the monoclonal antibody is a chimeric antibody or a 
humanized antibody. 

88. A method for treating a condition characterized by expression in a subject of abnormal 
amounts of a protein encoded by a nucleic acid molecule that is a NA Group 1 nucleic acid 
molecule, comprising 

administering to a subject a pharmaceutical composition of any one of claims 13-25 
and 37-47 in an amount effective to prevent, delay the onset of, or inhibit the condition in the 
subject. 

89. The method of claim 88, wherein the condition is cancer. 

90. The method of claim 88, further comprising first identifying that the subject expresses 
in a tissue abnormal amounts of the protein. 

91 . The method of claim 89, further comprising first identifying that the subject expresses 
in a tissue abnormal amounts of the protein. 
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92. A method for treating a subject having a condition characterized by expression of 
abnormal amounts of a protein encoded by a nucleic acid molecule that is a NA Group 1 
nucleic acid molecule, comprising 

5 (i) identifying cells from the subject which express abnormal amounts of the protein; 

(ii) isolating a sample of the cells; 

(iii) cultivating the cells, and 

(iv) introducing the cells to the subject in an amount effective to provoke an immune 
response against the cells. 

10 

93. The method of claim 92, further comprising rendering the cells non-proliferative, prior 
to introducing them to the subject. 

94. A method for treating a pathological cell condition characterized by aberrant 

15 expression of a protein encoded by a nucleic acid molecule that is a NA Group 1 nucleic acid 
molecule, comprising 

administering to a subject in need thereof an effective amount of an agent which 
inhibits the expression or activity of the protein. 

20 95. The method of claim 94, wherein the agent is an inhibiting antibody which selectively 
binds to the protein and wherein the antibody is a monoclonal antibody, a chimeric antibody 
or a humanized antibody. 

96. The method of claim 94, wherein the agent is an antisense nucleic acid molecule 
25 which selectively binds to the nucleic acid molecule which encodes the protein. 

97. The method of claim 94, wherein the nucleic acid molecule is a NA Group 3 nucleic 
acid molecule. 

30 98. A composition of matter useful in stimulating an immune response to a plurality of a 
proteins encoded by nucleic acid molecules that are NA Group 1 molecules, comprising 

a plurality of peptides derived from the amino acid sequences of the proteins, wherein 
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the peptides bind to one or more MHC molecules presented on the surface of the cells which 
express an abnormal amount of the protein. 

99. The composition of matter of claim 98, wherein at least a portion of the plurality of 
5 peptides bind to MHC molecules and elicit a cytolytic response thereto. 

100. The composition of matter of claim 99, further comprising an adjuvant. 

101. The composition of matter of claim 100, wherein said adjuvant is a saponin, GM-CSF, 
10 or an interleukin. 

102. The composition of matter of claim 98, further comprising at least one peptide useful 
in stimulating an immune response to at least one protein which is not encoded by nucleic 
acid molecules that are NA Group 1 molecules, wherein the at least one peptide binds to one 

15 or more MHC molecules. 

103. An isolated antibody which selectively binds to a complex of: 

(i) a peptide derived from a protein encoded by a nucleic acid molecule that is a 
NA Group 1 molecule and 
20 (ii) and an MHC molecule to which binds the peptide to form the complex, 

wherein the isolated antibody does not bind to (i) or (ii) alone. 

104. The antibody of claim 103, wherein the antibody is a monoclonal antibody, a chimeric 
antibody, a humanized antibody, or a fragment thereof. 
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SEQUENCE LISTING 

<110> Ludwig Institute for Cancer Research 

5 <120> BREAST, GASTRIC, AND PROSTATE 

CANCER ASSOCIATED ANTIGENS AND USES THEREFOR 



<130> L0461/7064WO 

10 

<150> US €0/136,526 
<151> 1999-05-28 

<150> US 60/153,454 
X5 <151> 1999-09-10 

<160> 1332 

<170> FastSEQ for Windows Version 3.0 

20 

<210> 1 

<21l> 1231 

<212> DNA 

<213> Homo sapiens 

25 

<400> 1 

gagaggacca agctaaacaa gcatatgttg acaagttgga agaattaatg aaaattggca 60 

ctccagttaa agttcggttt caggaagctg aagaacggcc aaaaatgttt gaagaactag 120 

gacagaggct gcagcattat gccaagatag cagctgactt cagaaataag gatgagaaat 1B0 

30 acaaccatat tgatgagtct gaaafcgaaaa aagtggagaa gtctgttaat gaagtgatgg 240 

aatggatgaa taatgtcatg aatgctcagg ctaaaaagag tcttgatcag gatccagttg 300 

tacgtgctca ggaaattaaa acaaaaatca aggaattgaa caacacatgt gaacccgttg 360 

taacacaacc gaaaccaaaa attgaatcac ccaaactgga aagaactcca aatggcccaa 420 

atattgataa aaaggaagaa gatttagaag acaaaaacaa ttttggtgct gaacctccac 480 

35 atcagaatgg tgaatgttac cctaatgaga aaaattctgt taatatggac ttggactaga 540 

taaccttaaa ttggcctatt ccttcaatta ataaaatatt tttgccatag tatgtgactc 600 

tacataacat actgaaacta tttatatttt cttttttaag gatatttaga aattttgtgt 660 

attatatgga aaaagaaaaa aagctttaag tctgtagtct ttatgatcct aaaagggaaa 720 

attgccttgg taactttcag attcctgtgg aattgtgaat tcatactaag ctttctgtgc 780 

40 agtctcncca tttgcatcac tgaggatgaa actgactttt gtcttttgga gaaaaaaaac 840 

ttgtactgct tgttcaagag ggctgtgatt aaaatcttta agcatttgtt octgccaagg 900 

tagttttctt gcattttgct ctccattcag catgtgtgtg ggtgtggatg tttataaaca 960 

agactaagtc tgacttcata agggctttct aaaaccattt ctgtccaaga gaaaatgact 1020 

ttttgctttg atattaaaaa ttcaatgagt aaaacaaaag ctagtcaaat gtgttagcag 1080 

45 catgcagaac aaaaacttta aactttctct ctcnctatac agtatattgt catgtgaaag 1140 

tgtggaatgg aagaaatgtc gatcctgttg taactgattg tgaacacttt tatgagcttt 1200 
aaaataaagt tcatcttatg gtgtcatttc t 

<210> 2 
50 <211> 965 

<212> DNA 
<213> Homo sapiens 

<400> 2 

55 ggagaatgaa atgtcttctg aagctgacat ggagtgtctg aatcagagac 
cccagacact gataaaaatg tccagcaaga caacagtgaa gctggaacac 
acaaactgat gctcaacaaa cctcacagtc tcccccttca cctgaactta 
aaacaaaatc ccagatgctg acaaagcaaa tgaaaaaaaa gttgaccagc 
taaaaagccc aaaataaagg tggtgaatgt tgagctgcct attgaagcca 

60 gcagttaggg aaagaccttc ttaacatgta tattgagaca gagggtaaga 
agataaattg gaaaaagaaa ggaatgatgc taaaaatgca gttgaggaat 
gttcagagac aagctgtgtg gaccatatga aaaatttata tgtgagcagg 
ttttttgaga ctcctcacag aaactgaaga ctggctgtat gaagaaggag 



caccagaaaa 60 

agccccaggt 120 

ccccagaaga 180 

ctccagaagc 240 

acttggtctg 300 

tgataatgca 360 

atgtgtatga 420 

atcatcaaaa 480 

aggaccaagc 540 
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30 



35 



40 



45 



taaacaagca tatgttgaca agttggaaga attaatgaaa attggcactc cagttaaagt 600 

tcggtttcag gaagctgaaa gaacggccca aaaatgtttg aagaactagg acagaggctg 660 

cagcatttat gcccagatag cagctgactt cagaaataag ggtgagaaat accacctttt 720 

tggatgagtc ttgaaatgaa aaaagtggga aaaatctgtt aatgaagtga ttgggaatgg 780 

attgaataat gtcttgaaag ctcaggctaa aaagaagtct tggatcaggg ntccaattgt 840 

nctgcctccn ggaaatttaa aacaaaaant cangggaatt gggacccccc attgtggaan 900 
ccgttgttac ccaaccccga aanccaaaaa ttggattccc ccccaactgg gnaaaaacct 



900 
960 
965 



10 <210> 3 

<211> 799 
<212> DNA 
<213> Homo sapiens 

15 <400> 3 £n 

attctgatcc ccaaggagtt ccatatccag aagcaaaaat aggccgcttt gtagttcaga eo 

atgtttctgc acagaaagat ggagaaaaat ctagagtaaa agtcaaagtg cgagtcaaca 120 

cccatggcat tttcaccatc tctacggcat ctatggtgga gaaagtccca actgaggaga 180 

atgaaatgtc ttctgaagct gacatggagt gtctgaatca gagaccacca gaaaacccag 240 

20 acactgafcaa aaatgfcccag caagacaaca gtgaagctgg aacacagccc caggtacaaa 300 

ctgatgctca acaaacctca cagtctcccc cttcacctga acttacctca gaagaaaaca 360 

aaatcccaga tgctgacaaa gcaaatgaaa aaaaagtfcga ccagectcca gaagctaaaa 420 

agcccaaaat aaaggtggtg aatgttgagc tgcctattga agccaacttg gtctggcagt 480 

tagggaaaga ccttcttaac atgtatattg agacagaggg taagatgata atgcaagata 540 

25 aattggaaaa agaaaggaat gatgctaaaa atgcagttga ggaatatgtg tatgagttca 600 

gagacaagct gtgtggacca tatgaaaaat ttatatgtga gcaggatcat caaaaatttt 660 

ttgaggactc ctcacagaaa actggaagaa ctggcttgtt ttgaagaaag ggagangacc 720 

aagcttaaac caagcatatg ttgacangtt tgggaagaat taatgaaaaa ttggcacttc 780 



- 700 

cagtttaagg ttcggtttc 

<210> 4 

<211> 141 

<212> DNA 

<213> Homo sapiens 

<400> 4 

cccacttctc gctgctcatg ccgctgggac tggggcggcg gaaaaaggcg ccccctctag eo 

tggaaaatga ggaggctgag ccaggccgtg gagggctggg cgtgggggag ccagggcctc 120 

tgggcggagg tgggtcgggg g 14 x 

<210> 5 
<211> 608 
<212> DNA 
<213> Homo sapiens 



<400> 5 

ggcagattga ttctttatgt tcaagacagc aaattcagat acaaaaaccc accgccatcg 60 

tccctccctc cctcctgctc tgggccaggg atgggcctgg aaggaaaaat tggaggtggg 120 

gaggaggfctg cggggttuac agcaaactcn tgtcaaatgc ggaggtaaca ggctncacag 180 

50 ggagggggct cctctcagga ggggtgaggg cattattgca tttgctgggg ggaaggacaa 240 

ccctctcccc tgtattccct gcgtcaggaa actaggaagg ncatgacccc caaacagaac 300 

ccaaggcccc agggagacag agggaccagt ttggcagctg- atggtggaaa gtggtggagg 360 

cgggggtggc cccccaattt ggctgatccc tcccctccct gtgcctgacc caactgaggt 420 

aggtggggaa cagggcacag gggggccggg gaccccggcc agactgggaa ccagggaggg 480 

55 gatggtccca ttggagcggg gcaaggggcc tggcccacct ccctnccatt gtccttgggc 540 

tgcttancta gctcagctgg aggctcnggt cctgantnaa ngtccctgct gggggccccc * nft 
ccaggtgc 

<210> 6 

60 <211> 397 

<212> DNA 

<213> Homo sapiens o 



600 
608 
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aaagtggtat aanccnnggg gtgccctaat tgnggtganc ttannctc 588 

<210> 73 

<211> 526 

<2X2> DNA 

<213> Homo sapiens 

<400> 73 cn 

cccacttctc gctgctcatg ccgctgggac tggggcggcg gaaaaaggcg ccccctctag &u 

tggaaaatga ggaggctgag ccaggccgtg gagggctggg cgtgggggag ccagggcctc 120 

tgggcggagg tgggtcgggg gnnnaacnnn nnanatnnnn cngnnngnnn nncctnnnnt 180 

tntanangnn ttxinnngata nganctnctn ttangacgag gatnnataat nctaatgcta 240 

naactcctnc tanctngnnn ggaattgatc ntangatggc ntatgcaaat angaagtntc 300 

attctggntt gatgnntggn ggcntaacta nngnattanc angngnnaan tttttctggg 360 

tntnctanga nattnngana aaatannggc ttngnannct anggcnatna nntntnatna 420 

cnananccta nnnngrxnttt ntnnnganaa gtntngnnng gaatgggatt ttgnctgcnt 480 

nngangntan gcntncngng gntnttngac cntttcnnga angaat 526 

<210> 74 

<211> 608 

<212> DNA 

<213> Homo sapiens 

<400> 74 

ggcagattga ttctttatgt tcaagacagc aaattcagat acaaaaaccc accgccatcg *0 

tccctccctc cctcctgctc tgggccaggg atgggcctgg aaggaaaaat tggaggtggg 120 

gaggaggttg cggggttcac agcaaactcn tgtcaaatgc ggaggtaaca ggctncacag 180 

ggagggggct ccfccfceagga ggggtgaggg cattattgca tttgctgggg ggaaggacaa 240 

ccctctcccc tgtattccct gcgtcaggaa actaggaagg ncatgacccc caaacagaac 300 

ccaaggcccc agggagacag agggaccagt: ctggcagctg atggtggaaa gtggtggagg 360 

cgggggtggc cccccaattt ggctgatccc tcccctccct gtgcctgacc caactgaggt 420 

aggtggggaa cagggcacag gggggccggg gaccccggcc agactgggaa ccagggaggg 480 

gatggtccca ttggagcggg gcaaggggcc tggcccacct ccctnccatt gtccttgggc 540 

tgcttancta gctcagctgg aggctcnggt cctgantnaa ngtccctgct gggggccccc 600 
ccaggtgc 

<210> 75 

<211> 891 

<212> DNA 

<213> Homo sapiens 

<400> 75 „ 

gtctggtgcc agcagccgcg gtaattccag ctccaatagc gtatattaaa gttgctgcag eo 

ttaaaaagct cgtagttgga tcttgggagc gggcgggcgg tccgcegcga ggcgagccae 120 

cgcccgtccc cgccccttgc ctctcggcgc cccctcgatg ctcttagctg agtgtcccgc 180 

ggggcccgaa gcgtttactt tgaaaaaatt agagtgfctca aagcaggccc gagccgcctg 240 

gataccgcag ctaggaafcaa tggaatagga ccgcggttct attttgttgg ttttcggaac 300 

tgaggccatg attaagaggg acggccgggg gcattcgtat tgcgccgcta gaggtgaaat 360 

tcttggaccg gcgcaagaog gaccagagcg aaagcatttg ccaagaatgt tttcattaat 420 

caagaacgaa agtcggaggt tcgaagacga tcagataccg tcgtagttcc gaccataaac 480 

gatgccgacc ggcgatgcgg cggcgttatt cccatgaccc gccgggcagc ttccccgaac 540 

cggtgacggt gtcgtggaac taagcccctg accagcggcg tgcacacctt nccgggtgtc 600 

ctacaagtct caggactcta ttcctcanca acgtggtgac cgtgcccttc anagcttggg 660 

cacccaaacc tacatntgca acgtgaatac aaacccagca acaccaangn gggcaagaaa 720 

agttgaaccc caatnttgtg aaaaaaactn acaacatgcc cacccgggcc cagaacctga 780 

antcttgggg gggaccgtaa gttttctttt nccccaaaac ccaaggacac ccttntgatt 840 

tcccggaccc tgaggtacat tcctngtggn ggaccnaacc nccaaaacct t 891 

<210> 76 

<211> 1046 

<212> DNA 

<213> Homo sapiens 

a* 
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<400> 


758 








Asp 


Ser 


Xaa 


Gin 


He 


Gin Cys 


Xaa 


1 








5 






Thr 


Xaa 


Ala 


Lys 


Pro 


Leu He 


Xaa 








20 








Pro 


His 


val 


Thr 


Lys 


Ser Xaa 


Ala 






35 








40 


Asp 


Pro 


Ala 


Gly 


Xaa 


Arg Xaa 


Ser 


50 








55 





His Xaa Asn Asp Thr Ala Thr Phe 

10 15 
Leu Ser Xaa Tyr Val Gin Xaa Gly 
25 30 
Glu Xaa Phe Gly Ser Xaa Asn Val 
45 

Lys Leu Leu Xaa Pro Phe 
60 





<210> 


759 








<211> 


68 








<212> 


PRT 








<213> 


Homo sapiens 






<400> 


759 






Thr 


Xaa Asn 


He 


Pro Phe He 


Ala 


1 






5 




Asn 


Lys Leu 


Leu 


Phe Lys Lys 


Val 






20 






Lys 


Phe His 


val 


He Leu Lys 


Phe 


35 






40 


Thr 


He Xaa 


Xaa 


Xaa Thr Xaa 


Xaa 




50 




55 




Lys 


Xaa Asn 


Ala 






65 











Tyr Val Xaa Tyr Ser Asn Glu Tyr 

10 15 
Arg Xaa Met Lys Ser Leu Leu Xaa 
25 30 
Leu Xaa Ala Asn Lys Ser Xaa Cys 
45 

Lys Xaa His ABp Xaa Phe Phe Cys 
60 



<210> 760 

<211> 91 

<212> PRT 

<213> Homo sapiens 





<400> 


760 








Trp 


Phe 


Arg 


Xaa 


Cys 
5 


Lys 


Ser Ser 


1 
Asp 


Leu 


Xaa 


Thr 


Val 


Ser Leu Cys 






20 








Ala 


Xaa 


Cys 


Leu 


Gin 


Lys 


Asn Xaa 






35 








40 


Gly 


Xaa 


He 


Val 


Gin 


Xaa 


Leu Leu 


50 










55 


Thr 


Trp 


Asn 


Leu 


Xaa 


Lys 


Arg Leu 


65 










70 




Arg 


Ser 


Leu 


Leu 


Tyr 


Ser 


Leu Glu 








85 







Cys 


He 


Val Xaa Met 


Xaa Thr Leu 


10 




15 


His 


Lys 


Val Asn Val 


Net Phe Gin 


25 




30 


Ser 


Xaa 


Ala Phe Xaa 


Xaa Xaa Xaa 






45 




Leu 


Ala 


Xaa Arg Asn 


Phe Lys He 






60 




Phe 


Met: 


Xaa Leu Thr 


Phe Leu Lys 






75 


80 


Tyr 


Xaa 


Thr 





90 



<210> 761 

<211> 46 

<212> PRT 

<213> Homo sapiens 



<400> 761 
His Phe Ser Leu Leu Met Pro Leu 

1 5 
Pro Pro Leu Val Glu Asn Glu Glu 
20 

Gly Val Gly Glu Pro Gly Pro Leu 
35 40 



Gly Leu Gly Arg Arg Lys Lys Ala 
10 15 

Ala Glu Pro Gly Arg Gly Gly Leu 
25 30 
Gly Gly Gly Gly Ser Gly 
45 



<210> 762 
<211> 46 
<212> PRT 

as© 
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<213> Homo sapiens 
<400> 762 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Pro Asp 


Pro 


Pro 


Pro 


Pro 


Arg 


Gly Pro Gly Ser 


1 






5 






10 


Pro Arg 


Pro 


Gly 


Ser 


Ala 


Ser 


Ser Phe Ser- Thr 




20 








25 


Phe Arg Arg 


Pro 


Ser 


Pro 


Ser 


Gly Met Ser Ser 




35 










40 


<210> 


763 










<211> 


181 










<212> 


PRT 










<213> 


Homo sapiens 




<400> 


763 










Ala Ala 


Gin 


Gly 


Gin 


Trp 


Xaa 


Gly Gly Gly Pro 


1 




5 






10 


Ser Asn 


Gly 


Thr 


lie 


pro 




Leu val Pro ser 




20 








25 


Gly Pro 


Pro 


Val 


Pro 


Cys 


Ser 


Pro Pro Thr Ser 


35 










40 


Gly Arg 


Gly 


Gly 


lie 


Ser 


Gin 


He Gly Gly Pro 


50 










55 




Leu ser 


Thr 


He 


Ser 


Cys 


Gin 


Thr Gly Pro Ser 


65 








70 




75 


Trp Val 


Leu 


Phe 


Gly 


Gly 


His 


Xaa Leu Pro Ser 






85 






90 


He Gin 


Gly 


Arg 


Gly 


Leu 


Ser 


Phe Pro Pro Ala 






100 








105 


Ser Pro 


Leu 


Leu 


Arg 


Gly 


Ala 


flu oci UcU w.*»ta 




115 










120 


Hie Leu 


Thr 


Xaa 


Val 


Cys 


Cys 


Glu Pro Arg Asn 


130 










135 




Asn Phe 


Ser 


Phe 


Gin 


Ala 


His 


Pro Trp Pro Arg 


145 








150 




155 


Gly Arg 


Trp 


Arg 


Trp 


Val 


Phe 


Val Ser Glu Phe 






165 






170 


Lys Asn 


Gin 


Ser 


Ala 










180 










<210> 


764 










<211> 


107 










<212> 


PRT 










<213> 


Homo sapiens 




<400> 


764 










Pro Pro 


Asn 


Arg 


Thr 


Gin 


Gly 


Pro Arg Glu Thr 


1 




5 






10 


Gin Leu 


Met 


Val 


Glu 


Ser 


Gly 


Gly Gly Gly Gly 




20 








25 


Leu He 


Pro 


Pro 


Leu 


Pro 


Val 


Pro Asp Pro Thr 




35 










40 


Gin Gly 


Thr 


Gly 


Gly 


Pro 


Gly 


Thr Pro Ala Arg 


50 










55 




Gly Met 


Val 


Pro 


lieu 


Glu 


Arg 


Gly Lye Gly Pro 


65 








70 




75 


His Cys 


Pro 


Trp 


Ala 


Ala 


Xaa 


Leu Ala Gin Leu 






85 






90 


xaa xaa 


Xaa 


Pro 


Cys 


Trp 


Gly 


Pro Pro Gin Val 






100 








105 


<210> 


765 











15 

Gly Gly Alfi 

30 
Lys Trp 
45 



15 

Leu Ala Gly val Pro 
30 

Val Gly Ser Gly Thr 
45 

Pro Pro Pro Pro Pro 
60 

Val Ser Leu Gly Pro 
60 

Phe Leu Thr Gin Gly 
95 

Asn Ala He Met Pro 
110 

Ser Leu Leu Pro Pro 
125 

Leu Leu Pro Thr Ser 
140 

Ala Gly Gly Arg Glu 
160 

Ala Val Leu Asn He 
175 



15 

Gly Pro Pro He Trj 
30 

Glu Val Gly Gly Git 
45 

Leu Gly Thr Arg Git 
60 

Gly Pro Pro Pro Xac 
80 

Glu Ala Xaa Val Let 
95 
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i i 
<211> 114 
<212> PM 
<213> Homo sapiens 

<400> 765 

Ala Pro Gly Gly Ala Pro Ser Arg Asp Xaa Xaa Ser Gly Xaa Glu Pro 

15 10 15 

Pro Ala Glu Leu Xaa Lys Gin Pro Lys Asp Asn Xaa Arg Glu Val Gly 

20 25 30 

Gin Ala Pro Cys Pro Ala Pro Met Gly Pro Ser Pro Pro Trp Phe Pro 

35 40 45 

Val Trp Pro Gly Ser Pro Ala Pro Leu Cys Pro Val Pro His Leu Pro 

50 55 60 

Gin Leu Gly Gin Ala Gin Gly Gly Glu Gly Ser Ala Lys Leu Gly Gly 
65 70 75 80 

His Pro Arg Leu His His Phe Pro Pro Ser Ala Ala Lys Leu Val Pro 

85 90 95 

Leu Ser Pro Trp Gly Leu Gly Phe Cys Leu Gly Val Met Xaa Phe Leu 
100 105 no 

Val Ser 



<210> 766 
<211> 129 
<212> PUT 

<213> Homo sapiens 



<400> 766 



Ser 


Ser 


Ser 


Asn 


Leu 


Arg 


Leu Ser Phe Leu He 


Asn Glu Asn He Leu 


1 








5 




10 


15 


Gly Lys 


Cys 


Phe 


Arg 


Ser 


Gly Pro Ser Cys Ala 


Gly Pro Arg He Ser 








20 






25 


30 


Pro 


Leu 


Ala 


Ala 


Gin 


Tyr 


Glu Cys Pro Arg Pro 


Ser Leu Leu lie Met 






35 








40 


45 


Ala 


Ser 


Val 


Pro 


Lys 


Thr 


Asn Lys He Glu Pro 


Arg Ser Tyr Ser He 




50 










55 


60 


lie 


Pro 


Ser 


Cys 


Gly 


He 


Gin Ala Ala Arg Ala 


Cys Phe Glu His Ser 


65 










70 


75 


80 


Asn 


Phe 


Phe 


Lys 


Val 


Asn 


Ala Ser Gly Pro Ala 


Gly His Ser Ala Lys 










85 




90 


95 


Ser 


He 


Glu Gly 


Ala 


Pro 


Arg Gly Lys Gly Arg 


Gly Arg Ala Val Ala 








100 






105 


110 


Arg Leu 


Ala 


Ala 


Asp 


Arg 


Pro Pro Ala Pro Lys 


He Gin Leu Arg Ala 






115 








120 


125 



Phe 



<210> 767 
<211> 157 
<212> PRT 

<213> Homo sapiens 

<400> 767 
Lys Net Ala Ala Gly Phe Lys Thr 

1 5 
Arg Phe Leu Lys Glu Asn Cys Arg 
20 

Phe Arg Thr Thr Thr Val Asn He 

35 40 
Ser Ala Leu val Lys Leu Gly Asn 

50 55 
Ala Glu Phe Ala Ala Pro Ser Thr 
65 70 



Val Glu Pro Xaa Glu Tyr Tyr Arg 

10 15 
Pro Asp Gly Arg Glu Leu Gly Glu 
25 30 
Gly Ser lie Ser Thr Ala Asp Gly 
45 

Xaa Thr Xaa He cys Gly val Lys 
60 

Asp Ala Pro Asp Lys Gly Tyr Val 
75 80 



SL8 X 



